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(57) Abstract 

Described aie lecombinant DNA molecules comprising the regulatory scquence(s) of an intnon of the Endothelial Growth Factor 
(VEGF) itceptor-2 gene (Flk-1) or of a gene homologous to the Flk-1 gene, being capable of conferring expression of a heterologous 
DNA sequence in endothelial cells, preferably in vivo. VectOTs comprising said DNA molecules as well as host cells containing the 
same are provided. Also provided aiB phannaceutical and diagnostic conditions comprising such recombinant DNA molecules and 
vectors. Furthermore, cells and transgenic non-human animals, comprising the aforementioned recombinant DNA molecules or vectors 
stably integrated into their genome and their use for the identification of substances capable of supi^essing or activating transcription of 
a gene in endothelial cells are described. Described is furtiier the use of the before described recombinant DNA molecules and vectors 
for the preparation of pharmaceutical compositions for treatmg, preventing, and/or delaying a vascular or tumorous disease in a subject. 
Furthermore, uses of the recombinant DNA molecules and vectors of the invention for the preparation of pharmaceutical compositions for 
inducing a vascular or tumorous disease in a non-human animal are provided. 
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REGULATORY SEQUENCES CAPABLE OF CONFERRING EXPRESSION OF A 
HETEROLOGOUS DNA SEQUENCE IN ENDOTHELIAL CELLS IN VIVO AND 
USES THEREOF 



The present invention relates to recombinant DNA molecules comprising the 
regulatory sequence(s) of an intron of the Vascular Endothelial Growth Factor 
(VEGF) receptor-2 gene (Flk-1) or of a gene homologous to the Flk-1 gene, being 
capable of conferring expression of a heterologous DNA sequence in endothelial 
cells in vivo. The present invention also relates to vectors comprising said 
recombinant DNA molecules as well as to host cells transformed with such 
recombinant DNA molecules or vectors. The present invention additionally relates to 
pharmaceutical and diagnostic compositions comprising such recombinant DNA 
molecules, vectors or cells. Furthermore, the present invention relates to cells and 
transgenic non-human animals, comprising the aforementioned recombinant DNA 
molecules or vectors stably integrated into their genome and their use for the 
identification of substances capable of suppressing or activating transcription of a 
gene in endothelial cells. The present invention also relates to the use of the before 
described recombinant DNA molecules and vectors for the preparation of 
pharmaceutical compositions for treating, preventing, and/or delaying a vascular or 
tumorous disease in a subject. Furthermore, ttie recombinant DNA molecules and 
vectors of the invention can be used for the preparation of phamiaceutical 
compositions for inducing a vascular or tumorous disease in a non-human animal. 

In tiie field of neuroscience and medical therapy, there is a great demand for test 
systems to study the function and interaction of gene products, th malfunction or 
expression of which cause vascular and/or tumorous diseases. Such systems would 
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also be suitable for drug development against such diseases. A prominent example 
for gene products involved in vascular diseases are angiogenic growth factors and 
their endothelial receptors which play a major role in the formation of the embryonic 
vascular system and in certain angiogenesis-dependent diseases, such as solid 
tumor gn3Wth or retinopathy. The ftinase-insert Domain-containing Receptor/fetal 
liver kinase-1 (KDR/Flk-1) in the following referred to as Flk-1 and Flt-1 are high 
affinity signaling receptors for the endothelial mitogen, vascular endothelial growth 
factor (VEGF) (Connolly, J. Clin. Invest. 84 (1989), 1470-1478; Leung, Science 246 
(1989), 1306-1309). Through interactions with its receptors, VEGF plays critical roles 
in growth and maintenance of vascular endothelial cells and in the development of 
new blood vessels in physiologic and pathologic states (Aiello, New Engl. J. Med. 
331 (1994). 1480-1487; Shweiki, Nature 359 (1992). 843-845; Berkman, J. Clin. 
Invest 91 (1993). 153-159). The pattems of embryonic expression of VEGF suggest 
that it is crucial for differentiation of endothelial cells from hemangioblasts and for 
development of blood vessels at all stages of growth (Jakeman. Endocrinology 133 
(1993), 848-859; Breier. Development 114 (1992), 521-532). Among many 
potentially angiogenic factors. VEGF is the only one with pattems of expression, 
secretion, and activity that suggest a specific angiogenic function in normal 
development (Klagsbrun, Cun^nt Biology 3 (1993), 699-702). 
High-affinity receptors for VEGF are found only on endothelial cells, and VEGF 
binding has been demonstrated on macro- and microvascular endolhelial cells and 
in quiescent and proliferating endothelial cells (Jakeman, Endocrinology 133 (1993). 
848^59; Jakeman. Clin. Invest. 89 (1992), 244-253). The Flk-1 and Flt-1 have been 
Identified as candidate VEGF receptors by affinity cross-linking and competition- 
binding assays (de Vries. Science 255 (1992). 989-991; Millauer. Cell 72 (1993). 
835-846; Tenman, Biochem. Biophys. Res. Commun. 187 (1992), 1579-1586). 
These two receptor tyrosine kinases contain seven similar extracellular 
immunoglobulin domains and a conserved intracellular tyrosine kinase domain 
interrupted by a kinase insert (de Vries, Science 255 (1992). 989-991; Matthews. 
Proc. Natl, Acad, Sci. U,S.A 88 (1991), 9026-9030; Ternian, Oncogene 6 (1991), 
1677-1683); they are expressed specifically by endothelial cells in vivo (Millauer. 
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Cell 72 (1993). 835-846; Peters, Proc. Natl. Acad. Sci. USA 90 (1993), 7533-7537; 
Yamaguchi, Development 118 (1993), 489-498). In situ hybridization in the 
developing mouse has demonstrated that Flk-1 is expressed in endothelial cells at 
all stages of development, as well as in the blood island in which endothelial cell 
precursors first appear (MHIauer. Cell 72 (1993), 835-846). Flk-1 is a marker for 
endothelial cell precursors at their earliest stages of development (Yamaguchi. 
Development 118 (1993), 489-498). 

The vascular endothelium is critical for physiologic responses including thrombosis 
and throml)olysis, lymphocyte and macrophage homing, modulation of the immune 
response, and regulation of vascular tone. The endothelium is also intimately 
involved in the pathogenesis of vascular diseases such as atherosclerosis (Ross, 
Nature 362 (1993). 801-809). Although a number of genes expressed in the 
endothelium have been characterized (Collins, J. BioL Chem. 266 (1991). 2466- 
2473; lademarco, J. BioL Chem. 267 (1992). 16323-16329; Jahroudi, Mol. Cell. Biol, 
14 (1994), 999-1008; Lee, J. Biol. Chem. 265 (1990). 10446-10450), expression of 
these genes is either not limited to vascular endothelium (e.g.. the genes encoding 
von Willebrand factor, endothelin-1, vascular cell adhesion molecule-1). or is 
restricted to specific subpopulations of endothelial cells (e.g., the gene for 
endothelial-leukocyte adhesion molecule-1). Flk-1 (also known as VEGF-receptor 2) 
is expressed in endothelial cells during embryonic and postnatal development. The 
Flk-1 receptor is the first endothelial receptor to be expressed in endothelial cell 
precursors during embryonic vascular development. Gene targeting experiments in 
transgenic mice have demonstrated that this receptor is essential for endothelial cell 
differentiation (Shalaby. Nature 376 (1995), 62-66). Furthemiore; in a variety of 
tumors, Flk-1 receptor expression is re-induced in the tumor vasculature, and it has 
been shown that signaling via the Flk-1 receptor is required for tumor vascularization 
and growth (Millauer. Nature 367 (1994). 576-579). 

Recently, in vitro studies with the upstream region of the human Flk-1 gene 
(Pattereon. J. Biol. Chem. (1995). 23111-23118) showed that DNA fragments 
located in the 5' flanking region of the hunran Flk-1 gene mediate xpression of a 
reporter gene. 
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neuronal cells, including neural retinal progenitor cells at all or different stages of 
development and haematopoietic cells (Yang, J. Neuroscl. 16 (1996), 6089-6099). 

The term "further regulatory sequences" refers to sequences which influence the 
specificity and/or level of expressiori; for example in the sense that they confer cell 
and/or tissue specificity or developmentally and/or inducible regulated gene 
expression. Such regions can be located upstream of or comprising the transcription 
initiation site, such as a promoter, but can also be located downstream of it, e.g., in 
transcribed but not translated leader sequences. 

The term "promoter" refers to the nucleotide sequences necessary for transcription 
initiation, i.e. RNA polymerase binding, and also includes, for example, the TATA 
box. 

The term "in vivo" for the purpose of the present invention is used for cells in an 
organism as opposed to cells growing in culture (in vitro). 

The term "heterologous" with respect to the DNA sequence being operatively linked 
to the promoter of the invention means that said DNA sequence is not naturally 
linked to the regulatory sequences comprised in the recombinant DNA molecule of 
the invention. 

In a prefenred embodiment said first regulatory sequence of the invention comprises 
a GATA-binding site, an AP-1 binding site, an SP1 binding site. site, an NFkB 
binding site, a STAT binding site, a Scl/Tal-1 binding site, an Ets-1 binding site, a 
PEA3 consensus sequence or any combination(s) thereof. A functional analysis of 
the first 6.5 kbp of the transcribed region of the murine Ftk-1 genes lead to the 
identification of a endothelial-specific positive regulatory element. This regulatory 
sequence is located in the region between the Xhol and BamHI restriction site in the 
first intron of the Flk-1 gene (cf. Fig. 4A). It is functional in both orientations since the 
intron enhancer was used in an antiparallel nrranner with respect to the Flk-1 
promoter fragment in the construct refen^ed to as 3*-ln 1; see Example 2 
hereinbelow. A sequence analysis of the intron lead to the identification of two 
potential GATA binding sites (+1927 Bp, +3514 Bp); a potential AP-1 binding site 
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(+2210 Bp) and two PEA3 consensus sequences (+3494 Bp, +3741 Bp); see Fig. 1. 
As demonstrated in Example 8, the intron sequences that were sufficient for 
endothelium-spectfic expression were contained in a 510 bp fragment (nucleotides 
10094 to 10608 of SEQ ID NO: 1). Several potential binding sites for known 
transcription factors could be identified therein (see Figure 12), including consensus 
binding sites for c-ets1, PEAS (an Ets-like transcription factor), GATA transcription 
factors, and Scl/Tal-1. The c-ets1 transcription factor was proposed to be involved in 
the eariy differentiation of endothelial cells from their precursors (Pardanaut Cell 
Adhesion and Communication 1 (1993), 151-160). In addition, c-ets1 is expressed in 
endothelial cells during tumor vascularization and other forms of angiogenesis in 
humans (Wernert, Am. J. Pathol. 140 (1992), 119-127). Proteins of the Ets family 
can activate transcription through a PEA3 motif (Wemert, 1992). Transcription 
factors of the GATA family are involved in the transcription of genes that are 
expressed in the hematopoietic and endothelial lineages, such as von Willebrand 
factor (Jahroudi, MoL Cell. BioL 14 (1994), 999-1008). Unlike the hematopoietic- 
transcription factor GATA-1, GATA-2 is expressed in both the endothelial and 
hematopoietic lineages (Elefanty, Blood 90 (1997), 1435-1447). Scin"al-1 has 
recently been implicated in the regulation of Flk-1 expression in Zebrafish (Liao, 
Genes Dev. 12 (1998). 621-626). The presence of two potential Scl/Tal-I binding 
sites in the murine Flk-1 intron enhancer suggests that Scl/Tal-1 might regulate Flk-I 
expression in mice. However, no direct effect of Scl/Tal-1 on F//c-^expression has 
been observed so far in mice, although Scl-null mice have vascular defects 
(Visvader, Genes Dev. 12 (1998), 473-479). Knock out experiments performed with 
the above-described regulatory sequences will easily reveal which of these elements 
present in. e.g., the 510 bp fragment (nucleotides 10094 to 10608 of SEQ ID NO; 1 ) 
are involved in the control of the regulatory sequence and the sequential order of 
these elements necessary to confer endothelium specific gene expression. Of 
course, the regulatory sequences obtained from such studies are also within the 
scope of the present invention. 



Preferablyi said first regulatory sequence is selected from the group consisting of 
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(a) DNA sequences comprising a nucleotide sequence as given in SEQ ID NO: 1 ; 

(b) DNA sequences comprising a nucleotide sequence of SEQ ID NO: 1 from 
nucleotide 8260 to nucleotide 10560, from nucleotide 8336 to nucleotide 10608 
and/or from nucleotide 10094 to nucleotide 10608; 

(c) DNA sequences comprising ffie nucleotide sequence of the human Flk-1- 
intron; 

(d) DNA sequences comprising a nucleotide sequence which hybridizes with a 
nucleotide sequence of (a), (b) or (c) under stringent conditions; 

(e) DNA sequences comprising a nucleotide sequence which is conserved in the 
nucleotide sequences of (a), (b) and (c); and 

(f) DNA sequences comprising a fragment, analogue or derivative of a nucleotide 
sequence of any one of (a) to (e) capable of confening expression in 
endothelial cells. 

In a particularly prefen^d embodiment of the present Invention, the regulatory 
sequences comprise the nucleotides 8260 to 10560, 8336 to 10608 (comprising the 
BamHI/Xhoi fragment of the first intron (+1677 bp/+3947); see Figure 4 and 
Examples 1 to 10), most preferably nucleotides 8560 to 10400 and still more 
preferably nucleotides 10094 to 10608 (comprising the Swal/BamHI fragment 
(+3437 bp/3947 bp); see Example 8) of the nucleotide sequence as set forth in SEQ 
ID No. 1 or a fragment thereof, which still confers expression in endothelial cells, 
preferably at all stages of development. 

In a further prefenred embodiment of the invention the heterologous DNA sequence 
of the recombinant DNA molecules described above Is operatively linked to further 
regulatory sequences. Expression comprises transcription of the nucleic acid 
molecule, preferably into a translatable mRNA. Regulatory elements ensuring 
expression in eukaryotic cells, preferably mammalian cells, are well known to those 
skilled in the art. They nonnally comprise promoters ensuring initiation of 
transcription and optionally poly-A signals ensuring termination of transcription and 
stabilization of the transcript. Additional regulatory elements may include 



W 0009855 63S [htl;i:)://wy.v.:,g^^^^ _. . Pag e 14of U 

WO 98/55638 PCT/EP98/03318 

12 

transcriptional as well as translational enhancers. Preferably said further regulatory 
sequence is a promoter and/or a 3'-untranslated region. 

Although some endothelial-specific promoters have been characterized, e.g. of the 
genes for von Willebrand factor (Jahroudi, MoL Cell Biol. 14 (1994), 999-1008). 
Endothelin-1 (Lee, J. BioL Chem. 265 (1990), 10446-10450), E-selectin (Collins. J. 
Biol. Chem. 266 (1991), 2466-2473), Tie-2 (Schlaeger, Development 121 (1995), 
1089-1098), VCAM-1 (lademarco, J. BioL Chem. 267 (1992), 16323-16329) and 
endothelial NO-synthase (Zhang, J. BioL Chem. 270 (1995), 15320-15326) these 
genes are neither specific for proliferating endothelium, nor necessary for endothelial 
cell detemiination. Due to the present invention these promoters can now be 
combined with the regulatory sequences of the invention in order to mediate 
endothelium specific gene expression of heterologous DMA sequences. However, 
other promoters can be used as well. For example, it is shown in Example 8 that the 
regulatory sequences of the invention conferred endothelium-specific gene 
expression to the heterologous herpes simplex virus-thymidine kinase (tk) promoter. 

In a preferred embodiment the above mentioned promoter is a promoter of hypoxia 
inducible genes, genes encoding growth factors such as VEGF, PDGF or Fibroblast 
growrth factor or their receptors or glycolytic enzymes. 

In a particularly prefen^ed embodiment said promoter comprises a ONA sequence 
selected from the group consisting of 

(a) DNA sequences comprising the nucleotide sequence as given in SEQ ID NO:1 
from nucleotide 6036 to nucleotide 6959; 

(b) DNA sequences comprising the nucleotide sequence of the human Flk-1 
promoter; 

(c) DNA sequences comprising a nucleotide sequence which hybridizes with a 
nucleotide sequence of (a) or (b) under stringent conditions; 

(d) DNA sequences comprising a nucleotide sequence which is conserved In the 
nucleotide sequences of (a) and (b); and 
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(e) DNA sequences comprising a fragment, analogue or derivative of a nucleotide 
sequence of any one of (a) to (d). 

At least one of the aforedescribed DNA sequences may be preferably of human or 
murine origin although other source's may be employed as well. Preferably, the 
heterologous DNA sequence being operatively linked to the regulatory sequences is 
located 5' to the regulatory sequence of the invention. 

In a further prefen-ed embodiment, the heterologous DNA sequence of the above- 
described recombinant DNA molecules encodes a peptide, protein, antisense RNA, 
sense RNA and/or rilxDzyme. The recombinant DNA molecule or vector of the 
invention can be used alone or as part of a vector to express heterologous DNA 
sequences, which, e.g., encode proteins other than Flk-1 , in cells of the blood vessel 
wall, i.e., endothelial cells, for, e.g., gene therapy or diagnostics of vascular diseases 
such as atherosclerosis. The recombinant DNA molecule or vector containing DNA 
sequence encoding a protein of interest is introduced into endothelial cells which in 
tum produce the protein of interest. For example, sequences encoding t-PA 
(Pennica. Nature 301 (1982), 214). p21 cell cycle inhibitor (El-Deiry. Cell 75 (1993). 
817-823), or nitric oxide synthase (Bredt. Nature 347 (1990), 768-770) may be 
operatively linked to the endothelial cell-specific regulatory sequences of the 
Invention and expressed in endothelial cells. For example, thrombolytic agents can 
be expressed under the control of the endothelial cell-specific regulatory sequences 
of the invention for expression by vascular endothelial cells in blood vessels, e.g., 
vessels occluded by aberrant blood clots. Other heterologous proteins, e.g., proteins 
which inhibit smooth muscle cell proliferation, e.g., interferon-7 and atrial natriuretic 
polypeptide, may be specifically expressed in endothelial cells to ensure the delivery 
of these therapeutic peptides to an atherosclerotic lesion or an area at risk of 
developing an atherosclerotic lesion, e.g., an injured blood vessel. 
The endothelial cell-specific regulatory sequences of the invention may also be used 
in gene th rapy to promote angiogenesis to treat diseas s such as peripheral 
vascular disease or coronary artery disease (Isner, Circulation 91 (1995), 2687- 
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2692). For example, the regulatory sequences of the invention can be operatively 
linked to sequences encoding cellular growth factors which promote angiogenesis, 
e.g., VEGF, acidic fibroblast growth factor, basic fibroblast growth factor and the like. 

In a most preferred emt>odiment of"The present invention, said protein is selected 
from the group consisting of Vascular Endothelial Growth Factor (VEGF), Hypoxia 
Inducible Factors (HIF), HIF-Related Factor (HRF). tissue plasminogen activator, 
p21 cell cycle inhibitor, nitric oxide synthase, interferon-y. atrial natriuretic 
polypeptide and monocyte chemotactic proteins. 

In another particulariy preferred embodiment of the invention, said protein is a 
scorable mariner preferably luciferase, green fluorescent protein or lacZ. This 
embodiment is particularly useful for simple and rapid screening methods for 
compounds and substances described herein below capable of modulating the 
expression of genes In the endothelium. For example, endothelial cells can be 
cultured with VEGF in the presence and absence of the candidate compound in 
order to determine whether the compound affects the expression of genes which are 
under the control of regulatory sequences of the invention, which can be measured, 
e.g.. by monitoring the expression of the above-mentioned mariner. It is also 
immediately evident to those skilled in the art that other marker genes may be 
employed as well, encoding, for example, selectable marker whichrprovide for the 
direct selection of compounds which induce or inhibit the expression of said marker. 

The regulatory sequences of the invention may also be used in methods of 
antisense therapy. Antisense therapy may be carried out by administering to an 
animal or a human patient, a recombinant DNA containing the endothelial cell- 
specific regulatory sequences of the invention operably linked to a DNA sequence, 
i.e.. an antisense template which is transcribed into an antisense RNA, The 
antisense RNA may be a short (generally at least 10. preferably at least 14 
nucleotides, and up to 100 or more nucleotides) nucleotide sequence fomnulated to 
be oomptem ntary to a portion of a specific mRNA sequence. Standard methods 
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relating to antisense technology have been described (Melani, Cancer Res. 51 
(1991), 2897-2901). Following transcription of the DNA sequence into antisense 
RNA, the antisense RNA binds to its target mRNA nralecules within a cell, thereby 
inhibiting translation of the mRNA and down-regulating expression of the protein 
encoded by the mRNA. For examptS. an antisense sequence complementary to a 
portion of or all of the Flk-1 (KDR) mRNA (Terman. Oncogene 6 (1991). 1677-1683 
and Terman (1992), supra) would inhibit the expression of Flk-1 , which in turn would 
inhibit angiogenesis. Such antisense therapy may be used to treat cancer, 
particularly to inhibit angiogenesis at the site of a solid tumor, as well as other 
pathogenic conditions which are caused by or exacerbated by angiogenesis, e.g., 
inflammatory diseases such as rheumatoid arthritis, and diabetic retinopathy. 
The expression of other endothelial cell proteins may also be inhibited in a similar 
manner, for example, endothelial cell proteins such as cell cycle proteins (thereby 
inhibiting endothelial cell proliferation, and therefore, angiogenesis); coagulation 
factors such as von Willebrand factor; and endothelial cell adhesion factors, such as 
ICAM-1 and VCAM-1 (Bennett, J. Immunol. 152 (1994), 3530-3540). 
Thus, in a further preferred embodiment of the present invention, said antisense 
RNA or said ribozyme are directed against a gene involved in vasculogenesis and/or 
angiogenesis and/or tumors of endothelial origin. 

In a further embodiment, the invention relates to nucleic acid molecules of at least 
15 nucleotides in length hybridizing specifically with a regulatory sequence as 
described above or with a complementary strand thereof. This means that they 
hybridize, preferably under stringent conditions, specifically with the nucleotide 
sequences as described above and show no or very little cross-hybridization with 
nucleotide sequences having no or substantially different regulatory properties. Such 
nucleic acid molecules may be used as probes and/or for the control of gene 
expression. Nucleic acid probe technology is well known to those skilled in the art 
who will readily appreciate that such prot)es may vary in length. Preferred are 
nucleic acid probes of 17. 18. 19. 20 to 25 and 25 to 35 nucleotides in length. Of 
course, it may also be appropriate to use nucleic acids of up to 100 and more 
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nucleotides in length. The nucleic acid probes of the invention are useful for various 
applications. On the one hand, they may be used as PGR primers for amplification of 
regulatory sequences according to the invention. Another application is the use as a 
hybridization probe to identify regulatory sequences hybridizing to the regulatory 
sequences of the invention by homology screening of genomic DNA libraries. 
Nucleic acid molecules according to this preferred emtx)diment of the invention 
which are complementary to a regulatory sequence as described above may also be 
used for repression of expression of a gene comprising such regulatory sequences, 
for example due to an antisense or triple helix effect or for the construction of 
appropriate ribozymes (see, e.g., EP-B1 0 291 533, EP-A1 0 321 201, EP.A2 0 360 
257) which specifically cleave the {pre)-mRNA of a gene comprising a regulatory 
sequence of the invention. Selection of appropriate target sites and corresponding 
ribozymes can be done as descril>ed for example in Steinecke. Ribozymes, Methods 
in Cell Biology 50, Galbraith et al. eds Academic Press, Inc. (1995), 449-460, 
Furthennore, the person sidlled in the art is well aware that it is also possible to label 
such a nucleic acid probe with an appropriate mariner for specific applications, such 
as for the detection of the presence of a nucleic acid molecule of the invention in a 
sample derived from an organism. 

Such molecules may either be DNA or RNA or a hybrid thereof. Furthermore, said 
nucleic acid molecule may contain, for example, thioester bonds and/or nucleotides 
analogues, commonly used in oligonucleotide anti-sense approaches. Said 
modifications may be useful for the stabilization of the nucleic acid molecule against 
endo- and/or exonucleases in the cell, Said nucleic acid molecules may also be 
transcribed by an appropriate vector containing a chimeric gene which allows for the 
transcription of said nucleic acid molecule in the cell. Such nucleic acid molecules 
may further contain ribozyme sequences which specifically cleave the (pre>-mRNA 
comprising the regulatory sequence of the invention. Furthermore, oligonucleotides 
can be designed which are complementary to a regulatory sequence of the Invention 
(triple helix; see Lee, Nucl. Acids Res. 6 (1979). 3073; Cooney, Science 241 (1988), 
456 and Dervan, Science 251 (1991). 1360), thereby preventing transcription and 
the production of the encod d protein. 
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The present invention also relates to vectors, particularly plasmids, cosmids, viruses, 
bacteriophages used conventionally in genetic engineering that comprise a 
recombinant DNA molecule of the invention. Preferably, said vector is an expression 
vector and/or a targeting vector. Expression vectors derived from vimses such as 
retroviruses, vaccinia vims, adeno^ssociated vims, herpes vimses, or bovine 
papilloma virus, may be used for delivery of the recombinant DNA molecule or 
vector of the invention into targeted cell population. Methods which are well known 
to those skilled in the art can be used to constaict recombinant viral vectors; see, for 
example, the techniques described in Sambrook, Molecular Cloning A Laboratory 
Manual, Cold Spring Hart)or Laboratory (1989) N.Y. and Ausubel, Current Protocols 
in Molecular Biology, Green Publishing Associates and Wiley Interscience. N.Y. 
(1989). Altematively, the recombinant DNA molecules and vectors of the invention 
can be reconstituted into liposomes for delivery to target cells. 
As is demonstrated in Example 11, the Flk-1 promoter was stimulated by HIF-2a, a 
basic helix-loop-helix/PAS domain transcription factor related to hypoxia-inducible 
factor-1. HIF-2a has previously been shown to stimulate the expression of VEGF, 
suggesting that HIF-2a may regulate the coordinate expression of both the VEGF 
receptor Flk-1 and its ligand in vivo. Thus, Flk-1 gene regulatory elements described 
herein can be used together with HIF-2a for the elucidation of the molecular 
mechanisms involved in endothelial cell specification and angiogenesis, and can be 
used to target expression of any transgene to the endothelium. Thus, in a prefen^ed 
embodiment, the vector of the invention furthers comprises a gene capable of 
expressing HIF-2a. 

The present invention furthemriore relates to host cells transfonned with a DNA 
molecule or vector of the invention. Said host cell may be a prokaryotic or eukaryotic 
cell. The vector or recombinant DNA molecule of the invention which is present in 
the host cell may either be integrated into the genome of the host cell or it may be 
maintained extrachromosomally. In this respect, it is also to be understood that the 
recombinant DNA molecule of the invention can be used for "g ne targeting" and/or 
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"gene replacemenf, for restoring a mutant gene or for creating a mutant gene via 
homologous recombination. 

The host cell can be any prokaryotic or eukaryotic cell, such as a bacterial, insect, 
fungal, plant or animal cell. Preferred fungal cells are, for example, those of the 
genus Saccharomyces, in particuFaT those of the species S. cerevisiae. Suitable 
mammalian cell lines comprise Saos-2 human osteosarcoma cells (ATCC HTB-85), 
HeLa human epidermoid carcinoma cells (ATCC CRL-7923), HepG2 human 
hepatoma cells (ATCC HB-8065), human fibroblasts (ATCC CRL-1634), U937 
human histiocytic lymphoma cells (ATCC CRL-7939), RD human embryonal 
rhabdomyosarcoma cells (ATCC CCL-136), MCF7 human breast adenocarcinoma 
cells (ATCC HTB-22), JEG-3 human choriocarcinoma cells (ATCC HB36). A7r5 fetal 
rat aortic smooth muscle cells (ATCC CRL-1444). and NIH 3T3 mouse fibroblasts 
(ATCC CRL-1658) obtainable from the American Type Culture Collection. Primary- 
culture HUVEC may be obtained from Clonetlcs Corp. (San Diego, CA) and can be 
grown in EGM medium containing 2% fetal calf serum (Clonetics). Primary-culture 
human aortic and intestinal smooth muscle cells can also be obtained from Clonetics 
Corp. Most preferably said host ceil is an endothelial cell or derived therefn^m, such 
as BAE cells. In view of the synergistic effect of the co-expression of a recombinant 
DNA molecule of the invention and HIF-2a, a further emtx)diment of the invention 
concerns the above-described cells which further comprise a recombinant DNA 
molecule or vector containing a gene capable of expressing HlF-2a. ' 

Moreover, the present invention relates to a pharmaceutical composition comprising 
at least one of the aforementioned recombinant DNA molecules or vectors of the 
invention, either alone or in combination, and optionally a phamiaceutically 
acceptable carrier or excipient. Examples of suitable pharmaceutical earners are well 
known in the art and include phosphate buffered saline solutions, water, emulsions, 
such as oil/water emulsions, various types of wetting agents, sterile solutions etc. 
Compositions comprising such earners can be fonnulated by well known 
conventional methods. These pharmaceutical compositions can be administered to 
the subject at a suitable dose. Administration of the suitable compositions may be 



PCL XL error 

Subsystem: IMAGE 

Error: ExtraData 

Operator: Readlmage 

Position: 8526 



WO 98/55638 PCT/EP98/03318 



acids to a specific site in the body for gene therapy or antisense therapy may also 
be accomplished using a biolistic delivery system, such as that described by 
Williams (Proc, Natl. Acad. Sci. USA 88 (1991), 2726-2729). 

Standard methods for transfecting cells with recombinant DNA are well known to 
those skilled in the art of molecular "Biology, see, e.g.. WO 94/29469. Gene therapy 
and antisense therapy to prevent or decrease the development of atherosclerosis or 
inhibit angiogenesis may be earned out by directly administering the recombinant 
DNA molecule or vector of the invention to a patient or by transfecting endothelial 
cells with the recombinant DNA molecule or vector of the invention ex vivo and 
infusing the transfected cells into the patient. Furthemnore, research pertaining to 
gene transfer into cells of the germ line is one of the fastest growing fields in 

genes into cells by ex-vivo or in-vivo techniques is one of the most important 
applications of gene transfer. Suitable vectors and methods for in-vitro or in-vivo 
gene therapy are described in the literature and are known to the person skilled in 
the art; see, e.g.. W094/29469, WO 97/00957 or Schaper (Cun^ent Opinion in 
Biotechnology 7 (1996), 635-640) and references cited therein. The DNA nnolecules 
and vectors comprised in the pharmaceutical composition of the invention may be 
designed for direct introduction or for introduction via liposomes, or viral vectors (e.g. 
adenoviral, retroviral) containing said recombinant DNA molecule into the cell. 
Preferably, said cell is a gemi line cell, embryonic cell, or egg^cell or derived 
therefrom. The phannaceutical compositions according to the invention can be used 
for the treatment of all kinds of diseases hitherto unknown as being related to the 
expression and/or over expression of genes in the endothelium. 

The present invention also relates to diagnostic compositions or kits comprising at 
least one of the aforementioned recombinant DNA molecules, vectors, cells and/or 
nucleic acid molecules and, in the case of diagnostic compositions, optionally 
suitable means for detection. 

Said diagnostic compositions may be used for methods of detecting and isolating 
regulatory sequences which are a functionally equivalent to the Flk-1 Intron 
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regulatory sequences of the invention. The kits of the invention may further contain 
compounds such as further plasmids, antibiotics and the like for screening 
transgenic animals and/or animal cells useful for the genetic engineering of non- 
human animals, preferably mammals and most preferably mouse. 

It is to be understood that the introduced recombinant DNA molecules and vectors of 
the invention express the heterologous DNA sequence after introduction into said 
cell and preferably remain in this status during the lifetime of said cell. For example, 
cell lines which stably express the heterologous DNA under the control of the 
regulatory sequence of the invention may be engineered. Rather than using 
expression vectors Nwhich contain viral origins of replication, host cells can be 
transformed with the recombinant DNA molecule or vector of the invention and a 
selectable marker, either on the same or separate vectors. Following the introduction 
of foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched 
media, and then are switched to a selective media. The selectable mart<er in the 
recombinant plasmid confers resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow to form foci which In tum can 
be cloned and expanded into cell lines. This method may advantageously be used to 
engineer cell lines which express the heterologous DNA sequence under the control 
of the regulatory sequence of the invention, and which respond to VEGF and/or 
hypoxia mediated signal transduction. Such engineered cell lines are particulariy 
useful in screening compounds capable of modulating Flk-1 gene expression. 
A number of selection systems may be used, including but not limited to the herpes 
simplex virus thymidine kinase (Wigler, Cell 11(1977), 223), hypoxanthine-guanine 
phosphoribosyltransferase {Szybalska, Proc. Natl. Acad. Sci. USA 48 (1962), 2026), 
and adenine phosphoribosyltransferase (Lowy, Cell 22 (1980), 817) genes can be 
employed in tk", hgprt: or aprt' cells, respectively. Also, antimetabolite resistance can 
be used as the basis of selection for dhfr, which confers resistance to methotrexate 
(Wigler. Proc. Natl. Acad. Sci. USA 77 (1980), 3567; O'Hare, Proc. Natl. Acad. Sci. 
USA 78 (1981), 1527), gpt, which confers resistance to mycophenolic acid (Mulligan, 
Proc. Natl. Acad. Sci. USA 78 (1981), 2072); neo, which confers resistanc to the 
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aminoglycx)side G-418 (Colberre-Garapin, J. Mol. Biol. 150 (1981), 1); and hygro. 
which confers resistance to hygromycin (Santerre, Gene 30 (1984), 147) genes. 
Additional selectable genes have been described, namely trpB, which allows cells to 
utilize indole in place of tryptophan; hisD, which allows cells to utilize histinol in place 
of histidine (Hartman, Proc. Natl ; Acad. Sci. USA 85 (1988), 8047); and ODC 
(ornithine decarboxylase) which confers resistance to the ornithine decarboxylase 
inhibitor. 2-(difluoromethylhDL-omithine, DFMO (McConlogue, 1987, In: Current 
Communications in Molecular Biology. Cold Spring HariDor Laboratory ed.). On the 
other hand, the person skilled in the art may also use the regulatory sequences of 
the invention to "knock out" an endogenous gene comprising identical or similar 
regulatory sequences, for example, by gene targeting, cosuppression, triple helix, 
antisense or ribozyme technology. 

The present invention also relates to a method for the production of a transgenic 
animal, preferably mouse, comprising introduction of a recombinant DNA molecule 
or vector of the invention into a germ cell, an embryonic cell or an egg or a cell 
derived therefrom. The non-human animal to be used in the method of the invention 
may be a wildtype, i.e. healthy animal, or may have a disease or disorder, preferably 
a disease or disonjer which is dependent on neovascularization, such as solid 
tumors, retinopathy, arthritis, psoriasis. Said disease or disorder may be an inbom 
insufficiency or natural developed or caused by genetical engineering, for instance 
by the expression of a DNA sequence encoding a protein involved in neuronal 
development and/or diseases as described above, preferably under the control of 
the regulatory sequence of the invention. 

The invention also relates to transgenic non-human animals comprising a 
recombinant DNA molecule or vector of the invention or obtained by the method 
described above, preferably wherein said recombinant DNA molecule is stably 
integrated Into the genome of said non-human animal, preferably such that the 
presence of said recombinant DNA mol cule or vector leads to the transcription 
and/or expression of the heterologous DNA sequence by the regulatory sequence of 
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the invention. Further non-human animals which may be employed according to the 
embodiments of the invention as described above are well known to the person 
skilled in the art and comprise rat hamster, dog, monkey, rabbit, pig. 

With the regulatory sequences of thelnvention. it is now possible to study in vivo the 
regulation of Flk-1 expression during angiogenesis. Furthermore, since VEGF and 
VEGF receptor genes seem to have different functions in different stages of 
development, it is now possible to detemnine domains of said proteins which may be 
important for their biological activity and/or for the regulation of their activity. In 
addition, it is now possible to in vivo study mutations which affect different functional 
or regulatory aspects of VEGF or its receptor or vector of the invention. 

Moreover, the present invention relates to a method for the identification of a 
chemical and/or biological substance capable of suppressing or activating and/or 
enhancing the transcription of a gene in endothelial cells comprising: 

(a) contacting a cell of the invention or the transgenic non-human animal of the 
invention either of which is capable of expressing the heterologous DNA 
sequence with a plurality of compounds; and 

(b) detennining those compounds which suppress or activate and/or enhance the 
expression of said heterologous DNA sequence. 

Said plurality of compounds may be comprised in, for example, samples, e.g. cell 
extracts from, e.g. plants, animals or microorganisms. Furthemnore, said compounds 
may be known in the art but hitherto not known to be capable of suppressing or 
activating and/or enhancing the transcription of a gene in endothelial cells. The 
plurality of compounds may be, e.g.. added to the culture medium or Injected into 
the animals. 

The tenn "plurality of compounds" in a method of the invention is to be understood 
as a plurality of substances which are either identical or not. If a sample containing a 
plurality of compounds is identified in the method of the invention, then it is either 
possible to isolate the compound finom the original sample identified as containing 
the compound capable of suppressing or activating and/or enhancing the 
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transcription of a gene in endothelial cells, or one can further subdivide the original 
sample, for example, if it consists of a plurality of different compounds, so as to 
reduce the number of different substances per sample and repeat the method with 
the subdivisions of the original sample. Depending on the complexity of the samples, 
this can be done several times, preferably until the sample identified according to the 
method of the invention only comprises a limited number of or only one 
substance(s). Preferably said sample comprises substances of similar chemical 
and/or physical properties, most preferably said substances are identical. 
Determining whether a compound is capable of suppressing or activating and/or 
enhancing the transcription of a gene in endothelial cells can be done, for example, 
in mice by monitoring reporter gene expression or by monitoring behavior of the 

tiarisgenic non-numan a aio ma •.iy^hmwh ^..^^^^^ - i 

compared to that of wild-type animals or compared to a transgenic non-human 
animal contacted with a compound which is either ioiown to be capable or incapable 
of suppressing or activating and/or enhancing the transcription of a gene in 
endothelial cells of said transgenic non-human animal of the invention. Furthermore, 
the person skilled in the art can monitor the physical behavior, or for example the 
movement of the above-described animals. Such methods are well known in the art. 
Such regulators of Flk-1 gene expression may be used in processes such as wound 
healing; in contrast, antagonists of expression may be used in the treatment of 
tumors that rely on vascularization for growth. Thus, the present in\7ention provides 
methods for identifying compounds which modulate VEGF receptor (e.g.. Flk-1 or 
Flt1) gene expression. Compounds found to downregulate expression of a VEGF 
receptor gene can be used in methods to inhibit angiogenesis. while compounds 
found to enhance Flk-1 or Flt1 expression can be used in methods to promote 
angiogenesis, for example, to promote wound healing (e.g.. healing of broken 
bones, bums, diabetic ulcers, and traumatic or surgical wounds) or to treat 
peripheral vascular disease, atherosclerosis, cerebral vascular disease, hypoxic 
tissue damage (e.g.. retinopathy, hypoxic damage to heart tissue), diabetic 
pathologies such as chronic skin lesions, or coronary vascular disease. These 
compounds can also be used to treat patients who hav , or have had. transient 
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ischemic attacks, vascular graft surgery, balloon angioplasty, frostbite, gangrene, or 
poor circulation. Compounds identified as having the desired effect (i.e., enhancing 
or inhibiting Flk-1 expression) can be tested further in appropriate models of 
endothelial cell grovrth and angiogenesis which are known to those skilled in the art. 
Given the therapeutic value of the'^compounds identified in accordance with the 
above-described method the present invention also relates to a method for the 
production of a pharmaceutical composition comprising the steps of the method of 
the invention and fonnulating the compound identified in step (b) in a 
pharmaceutically acceptable fomn. 

The therapeutic compounds identified using the method of the invention may be 
administered to a patient by any appropriate method for the particular compound, 
e.g.. orally, intravenously, parenterally, transdennally. transmucosally, or by surgery 
or implantation (e.g., with the compound being in the form of a solid or semi-solid 
biologically compatible and resorbable matrix) at or near the site where the effect of 
the compound is desired. For example, a salve or transdermal patch that can be 
directly applied to the skin so that a sufficient quantity of the compound is absoriaed 
to increase vascularization locally may be used. This method would apply most 
generally to wounds on the skin. Salves containing the compound can be applied 
topically to induce new blood vessel formation locally, thereby improving 
oxygenation of the area and hastening wound healing. Therapeutic doses are 
detemiined to be appropriate by one skilled in the art. 

Furthermore, klentification of transacting factors which interact with the regulatory 
sequences of the invention can form the basis for the development of novel 
therapeutics for modulating conditions associated with endothelial cell growth, such 
as angiogenesis, vascular disease, and wound healing. Identification of transacting 
factors is carried out using standard methods in the art (see, e.g., Sambrook, supra, 
and Ausubel, supra). To determine whether a protein binds to the regulatory 
sequences of the invention standard DNA footprinting and/or native gel-shift 
analyses can be carried out. In order to klentity the transacting factor which binds to 
the regulatory sequence of the Invention, the regulatory sequences can be used as 
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an affinity reagent in standard protein purification methods, or as a probe for 
screening an expression library. Once the transacting factor is identified, modulation 
of its binding to the regulatory sequence in the Flk-1 gene can be pursued, 
beginning with, for example, screening for inhibitors of transacting factor binding. 
Enhancement of Flk-1 expressiorT'in a patient and thus enhancement of 
angiogenesis, may be achieved by administration of the transacting factor, or the 
gene encoding it (e.g., in a vector for gene therapy). In addition, if the active fomn of 
the transacting factor is a dimer, dominant-negative mutants of the transacting factor 
could be made in order to inhibit its activity. Furthennore, upon identification of the 
transacting factor, further components in the pathway of Flk-1 signal transduction 
can be identified. Modulation of the activities of these components can then be 
pursued, in order to develop additional dmgs and methods for modulating 
endothelial cell growth and anglogenesis. 

As discussed in the background section of the description of the present invention, 
the interaction of VEGF and its receptor play an important role in the onset of 
angiogenic disease. Transgenic non-human animals expressing VEGF and/or its 
receptor gene and/or mutated versions thereof under the control of the regulatory 
sequences of the invention can now be used for the identification of substances, 
which, for example, are capable of restoring the wild-type interaction of mutated 
VEGF and its receptor either or both of which bear mutations.^Some genetic 
changes lead to altered protein confomnational states. Genetic changes may 
therefore result in a decreased binding activity of VEGF. Restoring the activity of 
mutant VEGF protein or increasing the activity of other proteins which interact with 
mutant VEGF pnateins is the most elegant and specific means to conrect these 
molecular defects. In addition, some genetic changes may result in altered 
conformational states of the receptor. This, in turn, may functionally inactivate the 
tyrosine kinase activity, making it Incapable of signal transduction, in order to restore 
the function of such mutant proteins an antibody may be used which binds to an 
epitope and Induces a confomnational change of the protein thereby restoring the 
wild type function. Thus, the methods of the invention are also useful to screen e.g.. 
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antibody, Fab, Fv or scFv expression libraries wherein the DNA sequence encoding 
said antibodies or derivatives thereof are under the control of the regulatory 
sequence of the invention. It is, of course, evident to the person skilled in the art that 
also other protein or peptide expression libraries using the regulatory sequences of 
the invention may be employed. ' 

Further, the present invention relates to the use of the recombinant DNA molecule, 
vector, cell, phannaceutical compositions, diagnostic compositions or a transgenic 
non-human animal of the invention for the identification of a chemical and/or 
biological substance capable of suppressing or activating and/or enhancing the 
transcription, expression and/or activity of genes and/or its expression products in 
endothelial cells. 

In a preferred embodiment the chemical or biological substance used in the 
methods and uses of the present invention is selected from the group consisting of 
peptides, proteins, nucleic acids, antibodies, small organic compounds, honmones, 
neural transmitters, peptidomimics, and PNAs (Milner, Nature Medicine 1 (1995), 
879-880; Hupp, Cell 83 (1995). 237-245; Gibbs. Cell 79 (1994), 193-198). 

The present invention further relates to a method of inhibiting a vascular disease in a 
subject, comprising contacting an artery of said subject with the recombinant DNA 
molecule or vector of the invention, wherein said protein reduces or prevents the 
development of the vascular disease, preferably said protein reduces proliferation of 
smooth muscle cells, 

In a further embodiment the present invention relates to the use of a recombinant 
DNA molecule, vector, nucleic add molecule of the invention and/or substance 
Identified by a method of the invention for the preparation of a composition for 
directing and/or preventing expression of genes specifically in endothelial cells 
and/or for the preparation of a phamnaceutical composition for treating, preventing 
and/or delaying a vascular disease and/or a tumorous dis ase in a subject. The 
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upregulation and activation of the Fll<-1 receptor In peri-tumoral endothelial cells is 
believed to be involved in the neovascularization of various human or experimental 
tumors (Plate, 1994; Ferrara. Cun-. Opin. Nephrol. Hypertens. 5 (1996), 35-44). This 
hypothesis is supported by experiments in which the inhibition of Flk-1 -mediated 
signal transduction strongly inhibits tomor angiogenesis and tumor growth (Millauer, 
Nature 367 (1994), 576-579; Millauer, Cancer Res, 56 (1996). 1615-1620). Thus, by 
using compounds of the present invention described above capable of inhibiting Flk- 
1 gene expression, it Is possible to ameliorate tumorous diseases which depend on 
the expression of the FLK-1 gene. 

In a further embodiment, the present invention relates to the use of a recombinant 
DNA molecule, vector and/or the nucleic acid molecule of the invention for the 
preparation of a pharmaceutical composition for inducing a vascular disease in a 
non-human animal or in a transgenic non-human animal described above. 

In a prefen-ed embodiment of the methods and uses of the invention, the vascular 
disease Is atherosclerosis and/or a neuronal disorder. Further possible methods and 
uses in accordance with the present invention will be evident to the person skilled in 
the art and are described in, for example. WO 95/13387. WO 94/11499 and 
WO 97/00957. 

The recombinant DNA molecules, vectors, nucleic ackl molecules, compounds, uses 
and methods of the invention can be used for the treatment of all kinds of disorders 
and diseases hitherto unknown as being related to or dependent on the modulation 
of genes specifically expressed in the endothelium. The recombinant DNA 
molecules, vectors, nucleic acid molecules, compounds, methods and uses of the 
present invention may be desirably employed in humans, although animal treatment 
is also encompassed by the methods and uses described herein. Thus, the present 
invention provides for the use of a regulatory sequence as defined above for 
enhancing and/or directing gene expression in endothelial cells in any kind of 
organism. 
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first 6.2 kbp of the transcribed region (cf. Fig. 4A). The staining was 
carried out as described in Fig. 5. 

Figure 8: In vivo characterization of the intron enhancer in combination with the 
strongest promoter fragment. All three embryos cany -the fi- 
galactosidase gene under the control of the Flk-1 promoter fragment 
of bp -640 to bp +299 and the intron enhancer. The staining was 
carried out as descril>ed in Fig. 5. 



Figure 9: Detailed analysis of the left-hand embryo from Fig. 8A) Left lateral 
view. B) Sectional magnification of A. C) Right lateral view. 

Figure 10: Histological evaluation of the embryo depicted in Fig. 9. The embryo 
was embedded in paraffin and was cut into 10 pm slices. The cuts 
were stained with neutral red. A) Pseudo transversal cut through the 
head region. B) Magnification from a similar cut level as In A. C) 
Pseudo transversal cut from a more caudally located section. D) 
Pseudo transversal cut from thoracal section. 1: 4th ventricle 
cerebmm, 2: 3rd ventricle cerebmm, 3; endbrain vesicle. 4: A. carotis 
intema, 5: ganglion trigeminale (V), 6: V. candinalis anterior, 7: neural 
tube. 8: esophagus, 9: V. candinalis posterior, 10: aorta dorsalis, 11: 
endocaixiium of the heart atrium, 12: vessels of the myocardium. 



Figure 11: Reporter gene analysis of Flk-1 gene regulatory elements in 
transgenic mouse embryos. The iacZ reporter gene was fused to 
regulatory elements derived from the mouse Flk-1 gene and tested for 
p-galactosklase expresston in transgenic mouse embryos. A) 10.5 
day transgenic mouse embryo expressing facZ under the control of a 
939 bp promoter fragment in combination with a 2.3 kbp Xhol/BamHI 
fragment of the first intron spanning the region from +1677 bp to 
+3947 bp of the Flk-1 gene. This embryo was derived from a foster 
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mother. Most if not all developing vascular structures show 
galactosidase expression, for example the endocardium of the heart, 
the dorsal aorta, intersomitic vessels or vessels of the developing 
brain. B) 11.5 day embryo of a transgenic mouse line that was 
established with the same constmct C) An 1 1 .5 day Flk-UlacZ knock- 
in embryo in which the lacZ gene is expressed frxDm the endogenous 
Flk-1 locus shows a highly similar staining. However, note the 
absence of p-galactosidase expression in small blood vessels of the 
yolk sac. D-F) Paraffin sections of the p-galactosidase stained embryo 
from (B) demonstrate p-galactosidase expression in the paired dorsal 
aortae (D), a venous vessel connected with the heart (E), and 
capillaries invading the neural tube (F). G) p-galactosidase expression 
in a transgenic embryo containing the tk promoter in combination with 
the 2.3 kbp Xhol/BamHI fragment of the Flk-1 first intron. H) p- 
galactosidase expression in a transgenic embryo containing a 
construct with Flk-I promoter sequences (-640 bp/+299 bp) in 
combination with the 510 bp Swal/BamHI fragment of the first intron 
spanning the region ftom +3437 bp to +3947 bp of the Flk-I gene. 
Bar D)-F)100 mM. 

Figure 12: Nucleotide sequence of the Flk-1 intron enhancer and putative 
transcription factor binding sites. Sequences matching known 
transcription factor binding sites are underlined. This sequence is 
deposit in the GeneBank database (accession number AF061804), 

Figure 13: Analysis of transgene expression during earty development and in 
newborn mice. The transgenic mouse line 2603 expresses /acZ under 
the control of the Flk-I promoter (-640 bp/+299 bp) in combination 
with the 2.3 kbp Flk-1 intronenhancer A) Frontal view on a whole 
mount p-galactosidase stained 7.8-day embryo. The arrow indicat s 
transgene expression in the extraembryonic mesoderm. B) and C) 
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Paraffin sections from the embryo shown in A demonstrate transgene 
expression in endothelial cells of the allantois (B) and the yolk sac 
(C). D-H, LacZ staining of spleen (D), kidney (E), lung (F), liver (G) 
and thymus (H) from a postnatal day 5 transgenic mouse. EM, 
extraembryonic meso"aerm. Bars, 25 ^iM (C). 100 \xM (B.D,E,RG.H). 

Figure 14: The 5'-UTR is required for expression of the Flk-1 gene in the yolk 
sac vasculature. Transgenic mouse embryos that contain a Flk-1 
promoter and 5'UTR (-640 bp/+299 bp) / enhancer (+1677 bp/+3947 
bp) reporter gene construct show a complete vascular expression in 
the yolk sac vasculature (A and B). In contrast, the yolk sac of Flk- 
1/lacZ knock-in embryos which lack part of the 5'UTR show 
expression only in large collecting vessels that connect with the 
embryo, but not in the smaller vessels (C). Bar. 500 fiM. 

Figure 15: HIF-2a stimulates Flk-I gene expression. A293 cells were co- 
transfected with a reporter gene construct containing Flk-I promoter 
sequences from bp -640 to bp +299 and with expression vectors 
encoding the murine HIF-1a and HIF-2a cDNAs, respectively. 
Relative promoter activities were detennined as described in 
Materials and Methods. The promoter activity *of the control 
transfection was arbitrarily set to 1 . 

The examples illustrate the invention. 

Example 1 : Cloning and construction of Flk-1 Intron/reporter gene vectors 

DNA Clones containing the 5' region of the mouse Flk-1 gene were isolated from a 
library prepared from 129/SvJ mice in X Dash II vector (Stratagene) (ROnicke, supra) 
or in X FIX II or obtained from the PI Library (Genome Systems, St. Louis). A 21 kb 
region of the mouse Flk-1 gene, contained in the DNA insertions of two X phages 6 
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and 16, including approximately 15 kb of 5' flanking sequences, axons 1. 2 and 3, 
and introns 1 and 2 was characterized by restriction enzyme mapping and Southem 
blot analysis. Lower DNA fragments of the phage clones were cloned into 
pBluescript vector DNA (Stratagene) and used for further characterization. 
Sequencing was perfonned using"' an automatic Sequencer (373A, Applied 
Biosystems). The nucleotide sequence of the Flk-1 intron enhancer is deposited in 
the Genbank database (accession number AF061804). The search for potential 
transcription factor binding sites was performed with the Matlnspector software 
(Quandt. (1995) Nucl. Acids Res. 23. 4878-4884). 

The DNA sequence (SEQ ID NO: 1) of a 12.8 kb region spanning from about 
position -6,660 kb (relative to the transcripttonal start site) to approximately position 
+6,135 kb (located in the third exon) was determined (Fig. 1). Figure 4A shows a 
schematic representation of the first 6.5 kbp of the transcribed region of the murine 
Flk-1 gene, Exons I. II and III are emphasized as hatched boxes. The first intron 
having a length of 3.6 kbp is subdivided into two regions (5Mn1 and 3'-ln1). The 
region ln-2 contains the entire second intron, the second exon, the 3' end of the first 
intron and part of the third exon. This subdivision into various intron fragments was 
maintained in the following analyses. The reporter gene constructs used were 
derived fn^m pGL2 basic vector (Promega) that contains a promoteriess luciferase 
gene. Luciferase reporter gene constmcts were generated for transfection of cells in 
vitro. For use in transgenic mice in vivo, plasmids were used in which the luciferase 
reporter gene was replaced by a lacZ reporter gene. 

In order to generate (luciferase) reporter gene constructs, Flk-1 promoter fragments 
were amplified by PGR and cloned into pGL2 (Promega) vector DNA 5' to the 
luciferase gene as described by ROnicke, supra; see also Figure 2. In short, the 
upstream primers used were -1900: 5'-GGG GTA CCG AAT TCT AAA TGG GGC 
GAT TAG C-3' (SEQ ID NO 2); -640: 5'-GTG GTA CCC AAA GAG TGA AGA GCA 
CTG.3XSEQ ID NO: 3); -624. 5'-TCG GTA GCG AGC GAG CGA GGA AGT TC-3' 
(SEQ ID NO: 4); the downstream primer was +299, 5'-TTG GTA AGG TTG GTG 
GAG GTG GGG GTG GG-3' (SEQ ID NO: 5). To g nerate the constaict ranging from 
-4100 to +299, a Hlndlll-EcoRI fragment of recombinant lambda phage 6 from P1 
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Library (Genome Systems, St. Louis) was Inserted into the plasmid ranging from 
-1900 to +299. Vectors that contained FII^-1 intron sequences in addition to promoter 
sequences were generated as follows: specific intron sequences were amplified by 
PGR from cloned Flk-1 genomic DNA and inserted downstream of the reporter gene. 
Primers used for amplification were'S'-lnldown: 5'-AGG GAT CCA CTC TFT AGT 
AGT AAG GCG-3' (nucleotides 7036-7057 of SEQ ID NO: 1, SEQ ID NO: 6); 5'- 
In1up: 5'-ACC TCG AGA CTT GGA TGG CAC-3' (nucleotides 8324-8342 of SEQ ID 
NO: 1. SEQ ID NO: 7); 3'-ln1down: 5'-GGG OTA TAA TTG GTG CCA TCC-3' 
(nucleotides 8312-8332 of SEQ ID NO: 1, SEQ ID NO: 8); 3'-ln1up: 5'-GGA TGG 
AGA AAA TCG CCA GGC-3' (nucleotides 10637-10658 of SEQ ID NO: 1. SEQ ID 
NO: 9); IN2A: 5'-GTG TGC ATT GTT TAT GGA AGG G-3' (nucleotides 10571-10593 
of SEQ ID NO: 1 , SEQ ID NO: 10); IN2B: 5'-CAT AGA CAT AAA CAG TGG AGG C- 
3' (nucleotides 12849-12871 which is part of the cDNA sequence published by 
Millauer (1993), supra, SEQ ID NO: 11). For the subsequent experiments the vector 
indicated in Fig. 3 was used. It represents a modification of the pGL2 basic vector in 
which the corresponding Flk-1 promoter fragments were inserted into the Kpnl and 
Hindlll restriction sites of the polylinker (Fig. 2). Also the luciferase reporter gene 
was replaced by the li-galactosWase gene (Schlaeger. Proc. Natl. Acad. Sci. USA 
94 (1997). 3058-3063). For an analysis of the intron intron fragments were cloned 
into the BamHI and Sail restriction sites indicated. DNA manipulations. PCR 
amplification and DNA sequencing were performed according to conventional 
methods known in the art as described, for example in Sambrook. supra and PCR 
Technology. Griffin and Griffin, eds., RC Press London (1994). 

Example 2: Functional analysis of the Intron of the Fik-1 gene In vitro 

Figure 4 shows the result of transient transfections in BAECs. The conresponding 
intron fragments were combined with a Flk-1 promoter fragments which comprised 
nucleotides -640 to +299. The promoter activity was standardized with respect to the 
promoter activity of the construct containing the 5'-ln1 fragm nL 
Tissue culture and transient transfections w re performed as foltows: 
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All cells were cultured in DMEM+ supplemented with 10% FCS (Sigma) and as 
described in Schaeger, 1997. bEndS cells were generated by transfonnation with the 
Polyoma middle-T oncogene as described earlier (Montesano. Cell 62 (1990), 435- 
445). Bovine aortic endothelial cells (BAECs) were prepared as described 
(Schwartz, In Vitro 14 (1978), 966-980). NIH 3T3. C2C12 and L cells were obtained 
from ATCC, Transient transfections were perfonmed using the CaP04-precipitation 
method according to Chen and Olcayama (Mol. Cell Biol. 7 (1987), 2745-2752), 
optionally with modifications as described (Ronicke, 1996). The transfection 
efficiency was monitored by co-transfection of a p-galactosidase reporter vector. 
Each construct was transfected at least six times in three independent experiments. 
Cells were grown to 70% confluence in 6-cm dishes prior to transfection. Cells were 
washed 16 hrs after addition of CaPO-precipitate and incubated for further 48 hrs. In 
each experiment, 6 ^lg luciferase and 1 \ig pCMV5 (ROnicke, supra) lacZ reporter 
gene constructs were used. Cells were lysed in 1 x reporter-lysis-buffer (Promega) 
for 15 min on a test tube-rotator After centrifugation, the supematant was 
transfen-ed to a fresh tube and stored at -80 **C or taken for luciferase-and lacZ- 
assay immediately. Reporter-gene assays for p-galactosidase activity were 
performed according to Eustice (Biotechniques 11 (1991), 739-740). Chlorophenol 
red-p-D-galactopyranoside (CPRG) was used as a substrate and the conversion 
was measured at 575 nm in an ELISA-reader (Biometra). Extracts were diluted to 
obtain ODs75nm values between 0.2 and 0.8. These values were used to standardize 
for transfection efficiency after subtracting a background value, as determined from 
a cell extract of a transfection without lacZ-reporter plasmld but with a luciferase- 
reporter plasmid. Luciferase-reporter gene assays were perfontied with the same 
extracts as described by the manufacturer (Promega). Luciferase activity was 
measured with a luminometer (LB96P, Berthold) and calculated as per cent of, the 
activity of the pGL2-promoter plasmW (Promega). 
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Construct 


5-1 n1 


3"-ln1 


In2 


BAEC 


100+/-0% 


128+/-34% 


136+/-52% 


3T3 


100+/-0% 


55+/-15% 


74+/-33% .. 



Table I: Functional analysis of the intron of the Flk-1 gene. The upper line indicates 
the corresponding intron fragment which was analyzed in combination with the Flk-1 
promoter (-640 bp/+299 bp). 



Figure 4C shows the results of another transfection assay of the intron fragments. It 
was earned out as described above, with the exception that a Flk-1 promoter 
fragments was used that comprised the region between nucleotides -4100 and 
+299. Also, a fragment was analyzed that contained the entire first intron. the 
second exon, the second intron and part of the third exon shown in Figure 4A. 



Construct 


5'-ln1 


3'-ln1 


In2 


ln1+2 


BAEC 


100+/-0% 


206+/-81% 


119+/-51% 


154+7-68% 


3T3 


100+/-0% 


71+/-32% 


85+/-27% 


35+/-12% 



Table II.: Functional analysis of introns of the Flk-1 gene. The upper line indicates 
the corresponding intron fragment which was analyzed in combination with the Flk-1 
promoter (-4100 Bp/+299 Bp). 



An analysis of this experiment revealed that the construct with the 3' region of the 
firet Intron in BAECs had an activity that was twice that of that containing the 5' 
region of the first intrxin. Also, it showed 85% higher activity than the construct with 
the second intron (p=0.0153). The 4.5 kbp longer constmct ln1+2 that also 
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contained the 3" region of the first intron, too. revealed an activity that was markedly 
higher in BAECs than in 3T3 cells. 

A functional analysis of the first 6.5 kbp of the transcribed region of the murine Flk-1 
genes lead to the identification of an endothelial-specific positive regulatory element. 
This regulatory sequence is located in the region between the Xhol and BamHI 
restriction site in the first intron of the Flk-1 gene (cf. Fig. 4A). It is functional in both 
orientations since the intron is used in an antiparallel manner with respect to the Flk- 
1 promoter fragment in the construct refered to as 3'-ln 1. In construct ln1+2, 
however, the original orientation was maintained. A sequence analysis of the intron 
enhancer lead to the identification of two potential GATA binding sites at position 
+1927 bp and +3514 bp; (Evans. Proc. Natl. Acad. Scl. USA 85 (1988). 5976-5980; 
Orkin. Blood 80 (1992). 575-581). a potential AP-1 binding site at position +2210 bp; 
(Lee, Cell 49 (1987). 741-752) and two PEA3 consensus sequences at position 
+3494 bp and +3741 bp; (Martin. Proc. Natl. Acad. Sci. USA 85 (1988). 5839-5843). 

Example 3: Functional characterization of the Flk-1 promoter In vivo. 

So far, analyses of the murine Flk-1 promoter have been restricted to in vitro 
systems (ROnicke. supra; Patterson, supra). The investigation of the promoter 
activity in vitro is an important tool in promoter characterizing since it is useful to 
assay a large number of promoter constaicls for their activity in a short time. 
However, this situation is always an artificial one since not all factors that are 
relevant in vivo can also be reconstituted in vitro. While an in vitro investigation of a 
promoter yields important infomration on the mechanisms of gene regulation it is 
only the in vivo characterization that can yield the final proof for the relevance of the 
elements identified. An excellent test system for promoter analysis in vivo are 
transgenic mice. In this model the con-esponding promoter fragment was cloned 
before a reporter gene, isolated together with this reporter gene and injected into 
fertilized mouse oocytes. In many cases, successful integration of the promoter 
reporter constmct into the mouse genome lead to transgenic mice which contain the 
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construct in every cell. This test system, in addition to the analysis of the promoter 
activity during embryonic development and in the adult animal, allows a tissue- 
specific characterization of the promoter activity. 

For the investigation of the Flk-1 promoter in transgenic mice the bacterial li- 
galactosidase reporter gene was chosen since the gene product is easily detectable 
by color reaction and remains at the location of production due to its limited 
solubility. In this manner it is possible to Identify cells in which the con-esponding Flk- 
1 promoter fragment since only there an expression of li-galactosidase took place. 
When producing transgenic mice it was taken care that no regions originating from 
the vector were injected along with the promoter. First. Flk-1 promoter fragments 
comprising the regions between nucleotkles -640 and +299. -1900 and +299 as well 
as -4100 and +299 were investigated. The constructs were based on plasmid pGL- 
2B described in Figure 2 with the exception that the luciferase reporter gene was 
replaced by the B-galactosidase gene. All injection fragments used in the examples 
were obtained by restriction digestion with the enzymes Kpnl and Sail. Transgenic 
mice were generated as described by (Hogan. Manipulating the Mouse Embryo 
(1994), Cold Spring Harbor Laboratory Press. New York). Fertilized oocytes were 
isolated from superovulated C57BU6 x C3H/He F1 mice, microinjected and 
reimplanted into pseudopregnant females of the same hybrid-mouse strain. Mice 
were sacrifized at day 10.5 or 11.5 of gestation, and embryos were analyzed by 
whole mount LacZ staining for transgene expression. The embryos to be examined 
weiB isolated on day 10 after reimplantation of the injected oocytes. Analysis of the 
transgenic embryos revealed that although promoter activity could be detected, 
none of the constructs was capable of conferring reproducible expression of the 
reporter gene in the endothelium. 

Example 4: Functional characterization of the Flk-1 intron in vivo 



After analysis of the Flk-1 promoter region from -4.1 kbp to +299 Bp the intron which 
was identified in vitro was then examined for its function in vivo. For this purpose, a 
construct similar to that shown in Figure 3 was used which contained an Flk-1 
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promoter fragment ranging from nucleotide -4100 to base pair +299 and the intron 
enhancer (3'-ln1, cf. Fig. 4A). The staining and fixation of the embryos was 
performed as follows: The mid-day of the plug observation was counted as E0.5. 
The embryos were dissected out in ice-cold PBS and fixed in ice-cold 2 % (w/v) 
paraformaledehyd. 2 mM MgCi^, 2 mM EGTA. 0.1 M Pipes buffer. pH 6.9 for15 to 
120 minutes. The embryos were rinsed with PBS three times for 5 minutes each. 
The LacZ expression was detected by incubating the embryos at 30 ovemight in 
0.1 % X-gal (S-bromo^-chloro-a-indolyl-p-D-galactopyranoside). 5 mM potassium 
ferricyanide. 5 mM potassium fenxjcyanide 5 mM. 1 to 2 mM magnesium chloride. 
0.002 to 0.02 % NP-40, 0.01 % or 0.25 mM sodium deoxycholate. PBS, pH 7.0. 
After the staining, embryos were rinsed in PBS and postfixed at 4 'C ovemight in 2 
% paraformaldehyde. 0.1 % glutaraldehyde. PBS. pH 7.0. For whole-mount 
photography, the postfixed embryos were rinsed in PBS and equilibrated for 3 
minutes each in (optionally 30 %) 50 % glycerol and then in 70 % glycerol. Figure 5 
shows an embryo (embryonic day 10.5) which was isolated after injection of the 
fragment. In Figure 5A a color reaction in vessels of the developing brain can be 
clearly discerned. Also superficial vessels in the body's middle (dorsal) and a 
staining in the liver bud can be observed. Figure SB shows the dorsal, caudal region 
of the same embryo. It proves that the vessels in both halves of the head were 
stained. 

For an exact localization of the stained cells the embryo was embedded in paraffin, 
cut into slices of 10 ^lm and counterstained with neutral red. Cryostat sectioning and 
lacZ staining of organs from postnatal mice was performed as described (Schlaeger. 
1997). 

The results of the histological analysis are shown In Figure 6. It shows pseudo 
transversal cuts of the embryo. In section 6A a staining of the inner lining of the V. 
cardinalis anterior (3) and of other superficial vessels can be seen. Figure 6B 
represents a strong magnification of a section of 6A with the staining of endothelial 
cells within the V. cardinalis anterior. Figure 6C shows a more caudally located 
region. Again, the staining of the V. cardinalis anterior and of superficial vessels with 
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wide lumen and thin walls as well as of vascular structures in the neural tube is 
clearly visible. Also, a staining of the chorda dorsalis (9) can be observed. However, 
in none of the cases a staining of arterial vessels could be observed. 
The subsequent injection of the same fragment lead to a total of eight further 
transgenic embryos which displayed an identical expression pattern albeit in two 
cases of weaker nature. Thus, the intron enhancer exhibited in vivo an effect that 
was even more marked than in vitro, in combination with a promoter fragment which 
on its own had a very variable expression pattern it ensures a reproducible 
expression pattern with clear endothelium specificity, however, covering a 
substantial part of the endogenous Flk-1 expression pattern. 

Example 5: Functional analysis of the introns I and II of the murine Flk-1 gene 
in vivo 

Since the intron enhancer in combination with an Flk-1 pronwter fragment displayed 
an endothelium-specific function in transgenic mice covering a substantial part of the 
endogenous expression pattern, the further search for in vivo relevant, gene 
regulatory elements was extended to other intron regions. For this purpose, the 
construct containing the promoter region between nucleotides -4100 to +299 and the 
first 6.5 kbp of the transcribed region (ln1+2; cf. Fig. 4) was used. Figure 7 shows an 
embryo on embryonic day 10.5 which was obtained after injection of this fragment. 
Again, a staining of the vessels in the developing brain as well as superficial vessels 
the of the liver bud was visible. The following injections yielded four further 
transgenic embryos which displayed the same pattern. A combination of the 
promoter region used with only the 5' end of the first intron (5'-ln1; cf. Fig. 4). 
however, yielded no endothelium-specific expression pattern. 
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Examples: Combination of the intron enhancer with the Flk-1 promoter 
fragment that was the most potent in vitro 

To investigate whether the repressing elements of the murine Flk-1 promoter 
between nucleotides -4100 and -640 are functional also in combination with the 
intron enhancer, a shorter construct without these inhibitory regions was used for 
further analysis. It contain the intron enhancer (3'-ln1) and the region from base 
pair -640 to nucleotide +299. This region displayed the highest activity in vitro. 
Figure 8 shows three transgenic embryos (embryonic day 10.5) which were obtained 
after injection of the fragment. All three display a more marked staining in vascular 
structures than the embryos analyzed so far. While the embryo on the right hand 
shows a weak staining, the left-hand embryo yields a very strong expression in 
virtually all vessels. The embryo in the middle holds a medium position as regards 
the completeness of its expression pattern, i.e., it lacks expression in the heart 
although it resembles strongly the embryo on the left hand as regards the staining of 
the other structures. In Figure 9A the left-hand embryo from Fig. 8 is shown in more 
detail. The strong staining of the heart in the region of the atrium and ventricle is 
particulariy clearly visible. Furthermore, the vessels of the developing brain, the 
vessels between the somites, the aorta dorsalis as well as the fine capillary plexus 
on the body's surface are stained. Figure 9B shows a sectional magnification of 9A. 
Here, the staining of the vessels in the head region as well as the expression in the 
superficial capillary plexus is visible. In Figure 9C the same embryo is shown from 
the other side. In addition to the structures described in Figure 9 A also a staining of 
the chorda dorsalis can be observed. 

The embryo shown in Figure 9 was embedded in paraffin and cut to slices. The cuts 
dyed with neutral red are shown in Fig. 10. Figure 10A shows a pseudo transversal 
cut through the head region. Particularly prominent is the branching of the A. carotis 
interna (4) in addition to the staining of other vascular stmctures. Figure 10B 
represents a magnification of a similar cut; here, too, the branching of the A. carotis 
interna is particulariy striking. Figure IOC shows a mor caudal cut which In terms of 
its position roughly corresponds to the cut shown in Figure 6A. Here, however, in 
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addition to the staining of tiie V. cardinalis anterior (6) an expression in the 
branching of the A. carotis interna (4) and other vascular structures is visible. Figure 
10D represents an even more caudally located region. A staining in the venous 
endothelium (V. cardinalis posterior. 9) and in the arterial stmctures {aorta dorsalis, 
10) can be observed. Furthemiorerthe endocardium of the atrium as well as the 
vessels in the trabeculae of the heart ventricles display an expression. 
A total of seven transgenic embryos was analyzed after injection of this fragment (- 
640 bp/+299 bp/3'-ln1). Safe for one which showed no staining, all embryos 
displayed an expression of the li-galactosidase in endothelial stmctures. The 
staining was regulariy more marked than in combination with the negative regulatory 
elements between nucleotides -4100 and -640. Thus, the in vitro identified regions 
displayed a function in vivo. The deletion of these negative regulatory elements 
yielded a constmct that lead to a reproducible expression in venous and arterial 
endothelium. 

Example?: Endothelium^pecinc expression mediated by Flk-I regulatory 
sequences in vivo 

When the Flk-1 promoter fragment with the strongest in vitro activity (-640 bp/+299 
bp; RSnicke. supra) was tested in combination with the 2.3 kbp Xhol/BamHI 
fragment of the firet intron that showed endothelium-specific activity in vitro (3-ln1; 
+1677 bp/+3947 bp; see Example 6), a reproducible vascular lacZ expression in 
transgenic El 0.5 mouse embryos derived from foster mothers was observed (Table 
III). In these embryos, the lacZ reporter gene was expressed in developing vascular 
structures, such as capillaries in the head region, intersomitic vessels, the dorsal 
aorta, and in the heart aniage (Fig. 11 A). Sectioning of these embryos confimied 
that the p-galactosidase protein was confined to vascular endothelium. This in vivo 
analysis demonstrated that the intron sequences in combination with the Flk-1 
promoter confer an endothelium-specific expression pattern that closely resembles 
the expression pattern of the endogenous Flk-1 gene (Mlllauer, 1993). Moreover, the 
intron fragment could also direct endothelial cell-specific lacZ expression wh n used 
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in an inverted orientation in the reporter construct (Construct -640 bp/+299 
bp//+3947 bp/+1677 bp; see Table III). 

Table III. Summary of the in vivo activity of different flk-1 constructs 



Construct 


TG 


ES 


ET 


NO 


^100/+299 


11 


0 


3 


8 


.1900/+299 


31 


1 


10 


20 


-640/+299 


3 


0 


1 


2 


^0/+299 // SIntron +1677/+3947 


7 


6 


0 


1 


^40/+299 // 3'lntron +3947/+1677 


4 


3 


1 


0 


-640/+299//3'lntron +3437/+3947 


7 


5 


0 


2 


-640/+254 // SIntron +1677/+3947 


12 


8 


1 


3 


tk // 3^lntron +1 6777+3947 


15 


3 


0 


12 


-5500/+299 // Intron l+ll 


3 


2 


0 


1 



Embryos transgenic for the constmcts given above were generated, and LacZ 
staining and genotyping was performed at E10.5 or E11.5 as described in Example 
4. Constmcts are defined by the position of the promoter or intron fragments in bp 
relative to the transcription initiation site of the endogenous Fll<-1 gene. TG. number 
of transgenic embryos; ES. number of embryos showing endothelial-specific 
staining; ET, number of embryos showing ectopic staining; NO, number of embryos 
showing no staining at all. 

Transgenic mouse lines were generated with this reporter gene constmct (-640 
bp/+299 bp//+1677bp/+3947bp) containing the Flk-I regulatory s quences. One of 
these lines (2603) showed a complete vascular expression of the reporter gene at 
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E11.5 and was analyzed further (Fig. 11B). Sectioning of p-galactosidase stained 
E11.5 transgenic embryos revealed that reporter gene expression was confined to 
the endothelium of blood vessels, e.g. in the endothelium of the dorsal aorta (Fig. 
11D), in venous vessels (Fig. 11E) and in the perineural vascular plexus and 
sprouting capillaries invading the neural tube (Fig. 11F). To detennine whether 
transgene expression in this mouse line reproduced the complete expression pattern 
of the endogenous Flk-1 gene, the lacZ staining pattern of these embryos was 
compared to heterozygous Flk-1 mutant mouse embryos which express the /acZ 
gene from the endogenous Flk-1 locus (Shalaby. Nature 376 (1995), 62-66). In 
these knock-in mice, the /acZ gene was inserted into the endogenous Flk-1 locus via 
homologous recombination and is therefore expected to reproduce the expression 
pattern of the endogenous Flk-1 gene (Shalaby. 1995). The lacZ staining pattern of 
transgenic embryos and the knock-in embryos at El 1.5 was indistinguishable (Fig. 
11B,C). It is concluded from these data that the -640 bp/+299 bp promoter region of 
the Flk-1 gene and the 2.3kbp Xhol/BamHI fragment of the first intron contain 
regulatory elements that are sufficient for endothelial-specific gene expression in 
developing mouse embryos. 

Examples: The first intron of the Flk-1 gene contains an autonomous 
endothelium -specific enhancer 

To assess the role of the 2.3 kbp Xhol/BamHi fragment of the first Flk-1 intron in 
endothelium-specific gene expression, it was further investigated whether the intron 
sequences can confer endothelium-specific expression to the heterologous herpes 
simplex vinis-thymidine kinase {tk) promoter. This promoter has no intrinsic 
endothelial cell specificity (Schlaeger. 1997). A /acZ reporter gene construct was 
generated that contained the tk promoter, in combination with the 2.3 kbp 
BamHl/Xhol fragment of the first intron (+1677 bp/+3947 bp). The tk promoter 
sequences were amplified from the plasmid ptkSDKIacZ (Schlaeger, 1997) using 
oligonucleotides tk5' (5'-CCGGTACCCAAACCCCGCCCAGCGTCTTG-3'; SEQ ID 
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NO: 16) and tk3' (5'-CCGACAAGCTTGGTCGCTCGGTGTTCGAGG-3'; SEQ ID NO: 
17). The PGR product was digested with Kpnl and Hindlll. 

From the p-galactosidase reporter constmct described in Example 2 (Fig. 2), Flk-1 
promoter sequences were exised and removed by Kpnl and Hindlll digestion. The tk 
promoter was then subcloned in the1<pnl and Hindlll restriction sites of the vector. 
Transgenic mouse embryos generated with this construct showed vascular reporter 
gene expression {Fig. 11G). The p-galactosidase staining observed in these 
embryos was weaker than in embryos expressing /acZ under the control of the -640 
bp/+299 bp Flk'1 promoter in combination with the intron fragment {Fig. 11A,B). 
Also, the frequency of transgenic mouse embryos expressing this transgene was 
significantly reduced, when compared with constructs driven by the -640 bp/+299 bp 
Flk-1 promoter in combination with the intron fragment (Table III). This indicates that 
the tk promoter lacks positive acting elements which are present within the Flk-1 
promoter. However, these results show that the Flk-I intron fragment alone, in 
contrast to the Flk-1 promoter, can repn^ducibly target reporter gene expression to 
the endothelium. Taken together, the results of both the in vitro an in vivo 
experiments in this study demonstrate that sequences located in the first intron of 
the mouse Flk-1 gene act as an autonomous endothelium-specific enhancer. 
In order to further characterize the minimal intron sequences that are required for 
endothelium-specific expression, we analyzed whether shorter intron fragments 
were also active in combination with the 939 bp promoter region of the Flk-I gene (- 
640 bp/+299 bp). By this deletion analysis, the intron enhancer was localized to a 
510 bp Swal/BamHI fragment (+3437 bp/+3947 bp) which is located immediately 
upstream of the second exon. This fragment was sufficient to stimulate endothelium- 
specific /acZ expression in transgenic mouse embryos (Fig. 11H, Table III). The DNA 
sequence of this fragment (Fig, 12) contains potential binding sites for the GATA 
and Ets transcription factors, and for SclATaM, all of which have been implicated to 
play a role in angiogenesis (reviewed by Risau. Nature 386 (1997). 671-674), 
Whether these consensus sequences represent functional transcription factor 
binding sites remains to be d t rmined. 
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Example 9: Flk-1 regulatory sequences target endothellum-specific transgene 
expression throughout development 

To test whether the regulatory sequences of the Flk-1 promoter and enhancer 
identified can reproduce the endogenous Flk-1 expression pattern throughout 
development, the /acZ expression pattern of the transgenic mouse line 2603 (Fig. 
11B) was further analyzed at various stages of embryonic development and at 
postnatal days 5 (P5) and 120 (P120). In this mouse line, the transcription of lacZ is 
driven in combination by the -640 bp/+299 bp Flk-1 promoter and the 2,3 kbp 
BamHI/Xhol intron enhancer fragment. The eariiest stage during which transgene 
expression was detectable by whole mount LacZ staining was in E7.8 embryos (Fig. 
13A). This is the earliest stage that was examined. The analysis of sections of these 
embryos confirmed that the transgene was expressed in angioblasts of the allantois 
and the yolk sac (Fig. 13B.C). Moreover, transgene expression was restricted to the 
vascular endothelium at all stages of embryonic development examined. To 
determine if the transgene expression persists after birth, we performed lacZ 
staining of cryostat sections from several different organs of P5 and P120 transgenic 
mice. Strong LacZ staining was detected in vessels of the spleen, kidney, thymus, 
liver and lung from P5 animals (Fig. 13D-H). However, lacZ expression was 
downregulated in most vascular beds of PI 20 animals, as is the case for the 
endogenous Flk-1 (Millauer. 1993; Kremer, Cancer Res. 57 (1997), 3852^859). 
Taken together, these results support the conclusion that the identified Flk-I 
regulatory sequences (the 939 bp pronwter in combination with the intron enhancer) 
are sufficient to reproduce most, if not all, properties of the endogenous Flk-I 
expression. 

Example 10:The 5'-UTR of the Flk-1 gene is required for expression of Flk-1 in 
the yolk sac vasculature 

In Flk-lllacZ knock-in embryos, the lacZ gene is under contnDl of all endogenous 
regulatory elements except for the regions from bp +137 to bp +299 in the 5-UTR 
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and approximately the first 600 bp of the first intron (Shalaby. 1995). It has been 
shown in accordance with the present invention that the intron sequences deleted in 
the knock-in construct created by Shalaby (1995) are not required to generate the 
strong and complete endothelial-specrfic reporter gene expression which is mediated 
by the Flk-I regulatory sequences described in this study (Fig. 1 1B and Table III). 
However, since the complete Flk-I 5*-UTR is present in the reporter gene construct 
which directs the most complete vascular-specific /acZ expression (-640 bp/+299 
bp//+1677 bp/+3947 bp; Fig. 1 1B and Table III), it allows to study the consequences 
of a partial 5'-UTR deletion on Flk-1 expression in vivo: Genomic DNA was prepared 
from unstained embryos or yolk sacs. Genotyping was performed by PGR analysis 
using the primer pairs -258fw/LacRev or LacZP1/LacZP2. Primers for PGR analysis 
were: -258fw: 5'-ATGGTAGGGAGGTTGCTGGGGGCAG-3' (SEQ ID NO: 12); 
LacRev: 5'-TGGTGCCGGAAACCAGGGAAA-3^ (SEQ ID NO: 13); LacZPI: 5'- 
ATCCTCTGCATGGTCAGGTC^' (SEQ ID NO: 14); LacZP2: 5'- 
CGTGGCGTGATTCATTCC-3' (SEQ ID NO: 15). The complete vascular staining of 
the Flk-1 lladZ knock-in embryos at E11,5 indicates that the 5'-UTR is not essential 
for vascular expression in the embryo proper. However, the yolk sac staining pattern 
of Flk-1 llacZ knock-in embryos and of transgenic mice from this study that harbored 
constructs containing the complete 5 -UTR were markedly different (Fig. 14A-C). The 
uniform vascular lacL expression in the transgenic yolk sacs from this study (Fig. 
14A, B) was absent in small vessels of the yolk sacs of the knock-In embryos (Fig. 
14C), in which only large yolk sac vessels were stained. In addition, it was found that 
replacement of the entire Flk-1 promoter including the 5 -UTR by the tk promoter in 
the present transgenic construct (Table III) leads to a similar /acZ expression pattern 
in the yolk sacs as that described in the yolk sacs of the Flk-HlacZ knock-in 
embryos. Thus, the 5 -UTR might be involved in specifying Flk-I expression in a 
subset of endothelial cells. 
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Example 11:The role of HIF-2a in Flk-1 regulation 

The Flk-1 promoter (-640 bp/+299 bp) confers endothelium-specific expression to 
the firefly luciferase reporter gene in transfected bovine aortic endothelial cells 
(R5nicke, 1996) and provides for a~strong reporter gene transcription in vivo; see 
Examples 6 to 10. This suggests that transcription factors that are specifically 
expressed in endothelial cells activate the Flk-1 promoter in a cell-type specific 
manner. 

The basic helix-loop-helix PAS-domain transcription factor. HIF-2a {also known as 
HLF, HRF or EPAS1), is expressed in endothelial cells during mouse embryonic 
development (Ema. Proc. Natl. Acad. Sci. USA 94. 4273-4278, 1997; Flamme, 
Mech. Dev. 63 (1997). 51-60; Tian. Genes Dev. 11 (1997). 72-82) and is thus a 
candidate to regulate Flk-1 expression. HIF-2a has previously been shown to 
stimulate both the expression of VEGF (Ema. 1997) and 77e2 (Tian. 1997). To 
detennine if HIF-2a might be involved in the regulation of Flk-1 gene expression. 
A293 cells were co-transfected with a luciferase reporter gene construct containing 
Flk-1 promoter sequences (-640 bp to +299 bp) and an eukaryotic expression vector 
that contained the mouse HIF-2a cDNA. Mouse HIF-2a and HIF-1a cDNAs were 
obtained from a mouse brain capillary endothelial cell cDNA library (Schnurch. 
Development 119 (1993). 957-968) with a 300 bp BamHI/Ncol fragment spanning 
the 5'UTR of HIF-1a. Positive phages were rescreened, and inserte were amplified 
by PGR using oligonucleotides HIF Start: (5'-GGGAATTCACCATG 
AGTTCTGAACGTCGAAAAG-3'; SEQ ID NO: 18) and HIF Flag Stop: (5'- 
AAGCGGCCGCTCATTTATCGTCATCGTCCTTGTAATCGTTAACTTGATCCAAAG 

CTCTG-3'; SEQ ID NO: 19). The PGR product was digested with Eco Rl and NotI 
and subcloned in the EcoRI and NotI restriction sites of pcDNA3 expression vector. 
The murine HIF-2a cDNA was obtained as described (Flamme. 1997). The phage 
insert was amplified by PGR using oligonucleotides HRF START (5- 
GGGAATTCAGGACAATGACAGCTGACAAGGAG-3': SEQ ID NO: 20) HRF rev (5'- 
AAGCGGCCGCTGATTTATCGTCATCGTGCTTGTAATGGTTGGTGGCGTGGTCCA 
GAGCTCTGAG-3'; SEQ ID NO: 21) and PGR product was digested with EcoRI/NotI 
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and cloned into the EcoRI and NotI sites of pcDNA3. The sequence encoding the 
FLAG epitope was included in the reverse oligonucleotide primer. HIF-2a and HIF- 
1a expression plasmids were constructed by inserting the FLAG-tagged cDNAs into 
the EcoRI and NotI sites of pcDNA3 (Invitrogen). For co-transfection assays, A293 
cells were split 1:2 into 35 mm dishes and transfected 18 hours later with 4 ng of 
DNA (2 \iQ of Flk-1 pronwter-driven luciferase plasmid. 1 ng of CMV promoter-driven 

galactosidase expression plasmid. and 1 of the HIF-2a or HIF-1a expression 
plasmids. or pBluescript SKII and pcDNAS as a control) using a transfection kit 
(MBS, Stratagene). After 20 hours, reporter gene activity measurements were 
performed using the Dual Light Kit (Tropix, Bedford, MA). The luciferase activity of 
each extract was normalized to the respective p-galactosidase activity. Endogenous 
background levels of both enzyme activities were measured using extracts from 
mock-transfected cells and were substracted. The nomialized luciferase activity of 
the control transfection was arbitrarily set to 1. Each value represents the average of 
at least six experiments. 

In comparison to cells transfected with the luciferase reporter constmct alone, co- 
transfection of the HIF-2a constnjct increased reporter gene activity approximately 
15-fold (Fig. 15). In contrast, HIF-1a , a close relative of HlF-2 which stimulates the 
hypoxia-induced transcriptton of the VEGF gene, failed to stimulate the reporter 
construct (Fig. 15). These results suggest that HIF-2a regulates the expression of 
the Flk-1 gene. 

Summary 

The mouse Flk-1 receptor is crucial for the differentiation of the hemangioblastic 
lineage and during embryonic vascular development (Risau. Annu. Rev. Cell Dev. 
Biol. 11 (1995). 73-91; Shalaby. 1995; Risau, 1997). Moreover. Flk-1 plays a central 
role in the regulation of neovascularization in a wide variety of tumors (Plate, Brain 
Pathol. 4 (1994) , 207-218; Ferrara. 1996). To elucWate the basis of its endothelial 
expression, regulatory sequences of the murine Rk-I gene have been isolated and 
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Characterized that confer endothelium-specific reporter gene expression in 
transgenic mouse embryos. Transgene expression driven by these sequences was 
strong, specific, and highly reproducible. Most importantly, it has been demonstrated 
that the isolated sequences were active in early stage vascular development and 
may thus represent a clue towards'tiie identification of the molecular mechanisms 
involved in hemangioblast differentiation and vasculogenesis. Moreover, transgene 
expression persists until shortly after birth and is downregulated in adult animals, as 
it v/as described for the endogenous Flk-1 gene (Miliauer, 1993; Kremer, 1997). 
Endothelium-specific expression in almost all transgenic mouse embryos tested was 
mediated by a 939 bp fragment of the promoter region in combination with a 
fragment of the first intron. 5'-flanking fragments up to -5.5 kbp alone were not 
sufficient to confer a reproducible endotheliunvspecific transgene expression. 
Reproducible endothelium-specific expression was therefore dependent on 
sequences from the first intron. These sequences also activated the heterologous tk 
promoter specifically in endothelial cells in vivo, and were active in an orientation 
independent manner. Thus, they fulfill the criteria for an autonomous tissue specific 
enhancer. 

As demonstrated in Example 8. the intron sequences that were sufficient for 
endothelium-specific expression were contained in a 510 bp fragment. Several 
potential binding sites for known transcription factors could be identified therein (see 
Figure 12), including consensus binding sites for c-etsi, PEA3 (an Ets-like 
transcription factor), GATA transcription factors, and SclH-al-l. The c-etsI 
transcription factor was proposed to be involved in the eariy differentiation of 
endothelial cells from their precursors (Pardanaut, Cell • Adhesion and 
Communication 1 (1993). 151-160). In additton. c-etsI is expressed in endothelial 
cells during tumor vascularization and other forms of angiogenesis in humans 
(Wemert. Am. J. Pathol. 140 (1992). 119-127). Proteins of the Ets family can 
activate transcription through a PEA3 motif (Wemert. 1992). Transcription factors of 
the GATA family are involved in the transcription of genes that are expressed in the 
hematopoietic and ndothelial lineages, such as von Willabrand factor (Jahroudi. 
Mol. C II. Biol. 14 (1994). 999-1008). Unlike the hematopoietic-transcription factor 
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GATA-1 , GATA-2 is expressed in both the endothelial and hematopoietic lineages 
(Elefanty. Blood 90 (1997), 1435-1447). Scl/Tal-1 has recently been implicated in the 
regulation of Ftk-1 expression in Zebrafish (Liao. Genes Dev. 12 (1998). 621-626). 
The presence of two potential Sdn"al-1 binding sites in the murine Flk-1 intron 
enhancer suggests that Scin"al-1 might regulate Flk-1 expression in mice. However, 
no direct effect of Scl/Tal-1 on Flk-1 expression has been observed so far in mice, 
although Scl-null mice have vascular defects (Visvader. Genes Dev. 12 (1998). 473- 
479). 

Recently, analyses of the regulatory elements of other endothelium-specific genes 
such as von Willebmnd factor (Aird, Proc. Natl. Acad Sci .USA. 92 (1995), 4567- 
4571), c-ets-1 (Jorcyk, Cell. Mol. Biol (Noisy-le-grand) 43 (1997), 211-225) or the 
endothelial receptors. 77ef (Korhonen, Blood 86 (1995). 1828-1835) and 77e2 
(Schlaeger, Development. 121 (1995), 1089-1098; Schlaeger. 1997) have been 
reported. The most uniform expression pattern reported was conferred by regulatory 
elements of the 77e2 gene. However, in contrast to Flk-1, expression of 77e2 and of 
reporter genes driven by 77e2 regulatory sequences is not downregulated in adult 
animals. Such as in the Flk-1 gene, the first introns of the 77e2 gene and of the Ets-1 
gene are involved in endotheliunrv-specific expression. Similar to the Flk-1 intron 
enhancer, the first intran of the Tie2 gene also contains an autonomous endothelial 
specific enhancer. A major difference bebNeen the stmctural organisation of the 
regulatory elements of the Flk-1 gene and the 7ie2 gene is, however, that the 77e2 
promoter by Itself is active in certain embryonic blood vessels (Schlaeger, 1995). At 
least during the developmental stages analyzed (i.e. E10.5 and El 1.5) an 
autonomous function of the Flk-1 promoter was not observed. The intronic 303 bp 
Tie2 core enhancer also contains potential binding sites for transcription factors of 
the Ets and GATA families (Schlaeger, 1997). and o-etsi or PEA3 binding sites are 
present In the promoters of Tie1. 77e2 and Flt-I (Kortionen. 1995; Schlaeger. 1995; 
Wakiya, J. Biol. Chem. 271 (1996), 30823-30828). 

Analysis of Flk-1 HacZ knock-In mouse embryos that express the /acZ gene from the 
ndogenous Flk-1 locus has previously shown that the fecZ reporter gene is 
expressed ubiquitously in the developing intra-embryonic vasculature and the yolk 
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sac of E7.5 embryos (Shalaby. 1995). However, in accordance with the present 
invention it was found that a fragment of the 5' UTR that is deleted in the l<nocl^-in 
construct is required for reporter gene expression in the yoli< sac vasculature during 
later stages of embryonic development. Based upon transient transfection analyses 
in bovine aortic endothelial cells. fRe Flk-1 5-UTR has been shown to contain a 
positive acting, endothelial cell-specific element between nucleotides +136 and +299 
(Renicke. 1996). The complete vascular staining of the Flk-1llacZ knock-in embryo 
proper at Ell. 5 demonstrates that the 5-UTR is not essential for intraembryonic 
vascular expression at this developmental stage. 

The involvement of HIF-2a in the regulation of Flk-1 expression further emphasizes 
the role of basic helix-loop-helix/PAS-domain transcription factors in the regulation of 
components of the VEGF signal transduction system and of vascular development. 
The upregulation of VEGF in response to hypoxia is generally thought to be 
mediated by HIF-1. Moreover, mouse embryos lacking functional genes for HIF-1a 
or ARNT show defects in vascular development, perhaps due to reduced VEGF 
levels (Maltepe. Nature 386 (1997). 403-407; Iyer, Genes Dev. 12 (1998), 149-162). 
This observation indicates that the physiological relevance of these transcription 
factors is not restricted to adaptation to hypoxia, but extends to the regulation of 
nomial vascular development. HIF-2a is expressed in various tissues, including the 
developing endothelium of several organs, for example in the brain (Flamme, 1997). 
It seems therefore likely that HIF-2a is involved in the regulation oi-Flk-1 expression 
in blood vessels that co-express H/F-2a and Flk-1. Interestingly. H/F-2a is also 
expressed in tissues that express the Flk-1 receptor ligand, VEGF, and has been 
shown to stimulate VEGF expression (Ema, 1997). Taken together, these 
observations support the hypothesis that HIF-2a is both an intrinsic and extrinsic 
regulator of blood vessel growth and function (Flamme. 1997). by stimulating both 
receptor and ligand expresston. The expression of VEGF and Flk-1 shovws a 
remarkable coordinate temporal pattern both in development and in tumors. For 
example, VEGF and Flk-1 are expressed transiently in the developing mouse brain, 
and are largely down-regulated in the adult, but reactivated in brain tunwrs (Plate. 
1994). In hemangloblastomas of the brain, which are highly vascularized tumors, 
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both the VEGF and Flk-1 expression are highly up-regulated, and this correlates with 
the up-regulation of H/F-2a expression in the stromal cells of this tumor type. 
Whether HIF-2a contributes to the remarkably coordinated expression of VEGF and 
Fik-1 in other tumor types, remains to be established, since for example in 
glioblastomas - another cerebral tumor - up-regulation of VEGF is due to hypoxia, 
and HIF-2a is inducible by hypoxia. Unlike the expression of VEGF and F//-t F\k-1 
expression is not directly stimulated by hypoxia (Gerben J. Biol. Chem. 272 (1997). 
23659-23; Kremer, 1997). Thus, the primary function of HIF-2 in the regulation of 
Flk'1 expression does not appear to be related to the hypoxia response. 
Among the endothelial RTK identified thus far, Flk-1 is the only receptor whose 
function is required for the detennination of the endothelial lineage. Therefore, the 
Flk'1 gene represents the ideal candidate for studying the transcriptional regulatory 
mechanisms that are active during the emergence of the endothelial lineage. The 
observation that the isolated regulatory elements of the Flk-I gene are active in earty 
stage vascular development are of great importance for this objective. Knowledge of 
the Flk'1 gene regulatory sequences is also of great potential relevance in the 
therapy of certain diseases. The Flk-1 receptor has been demonstrated to be a key 
regulator of angiogenesis in various diseases, including cancer (Plate. 1994). 
Therefore, the study of the regulatory elements involved in the upregulation of Flk-1 
expression in the tumor endothelium appears to be particulariy relevant for studying 
the mechanisms of tumor angiogenesis. Further studies will unravel whether the 
same regulatory elements of the Flk-1 gene that confer endothelium-specific 
expression in mouse embryos are also active in the tumor vasculature. Flk-I gene 
regulatory elements active in the tumor vasculature may provide infomiation about 
the signaling pathways that can be targeted for anti-angiogenic tumor therapy. 
Finally, the Flk-I gene regulatory elements will be useful for targeting expression of 
genes to the vasculature. An attractive possibility is the expression of suicide genes 
(Ozaki, Hum, Gene Then 7 (1996). 1483-1490) under the control of these elements. 
The use of the Flk-1 gene regulatory elements in combination with, e.g., the 
Cre/loxP system may provide a powerful tool for specifically inactivating genes in the 
developing vasculature or in tumor endothelium. 
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The present invention is not to be linriited in scope by its specific embodiments 
described which are intended as single illustrations of individual aspects of the 
invention and any DNA molecules, or vectors which are functionally equivalent are 
within the scope of the invention. Iiiaeed, various modifications of the invention in 
addition to those shown and described therein will become apparent to those skilled 
in the art from the foregoing description and accompanying drawings. Said 
modifications intended to fall within the scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Max-Planck-Gesellschaf t zur Foerderung der 

Wisaenschaften e. V. 

(B) STREET: none 

(C) CITY: Berlin 

(D) STATE: none 

(E) COUNTRY: Germany 

(F) POSTAL CODE (ZIP) : none 

(ii) TITLE OF INVENTION: Regulatory Sequences Capable Of Conferring 
Expression Of A Heterologous DNA Sequence In Endothelial 
Cells In Vivo And Uses Thereof 

(iii) NUMBER OF SEQUENCES: 21 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1,0, Version ftl.30 (EPO) 



<2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
TCTAGAATAT AGAAGATAAG TTTGCGTACA ATTCAGTCCT TTGAAGACCT GATAAGCTTT 
AAGAAGGAAG ATGGGTTACA CATTGGGAAA TGGTTGCAAT CTGCACATGG CAGAGGCAAG 
AGATGCAAAT CACIATTTCTT ACATACTCCA TACAAATCTT ACAAGACTGT TTTTCTTTCT 
CATTTAAAAT AAGAAGACCT GCCAGTCTTC CCdTATTAC TAATTACAGT CACTCTGTAT 
CTTTGTTGAC ATTGGATAGT TTTACATACT TCAACAGGCT GGTGTCATTA AAGTTGTGGT 
GGGTGGGCAC CAGAGACACG TGATTCAGAG TGGGAGGAGA TGCAGGAGAA ACGAGGCACA 
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GCAGAAGCAG AAGCGAGGAA AAACACTCTC AACGTTACTA ACACATCGAG 
CACTAGCAAT ACGGGCTGAA TCTGACCTAA TCTCTGCTGT TGAAAATTTT 
ACACTAGCAA TACGGGCTGA ATCTGACCTA ATCTCTGCTG TTGAAAATTT 
GTCACACAAG TGCTGAGCAT ACAGAAAAAG GAGAGTAATT CTCTGGTTCT 
AAATAGTCTA TATCAAATTG CCTAAGATAA TGTATACATT TAGTACATGA 
CTATTCTATA TGACTATTAT TTAAATGTGA ATTTACAAGT GAGCATATGA 
ACATGGCTAG TACATATAAC TTTTAAAAAG TTGGACATAG rTATATTTTT 
ATTTACTTTA TATCCTGATC ACAGACCCCC CCCTCCTCTG GATTAACTCT 
TCTTACCCCT CCCCATCTCT CCTTCACCTC TGAGAAGGGG GGATACCTCC 
GGTTTCAGTG GGAGAAGGAT GTATCCTAAC ACATATAATT TTTAATATCC 
TTTCATACAC CTTACTTATT CTATTCATTT TTCAGGAAGG CATGnTAAT 
TAATTTTATG TGTACGAGTG TTTTGCCTAC ACAGTCATAG TGCATCGCAT 
TGCCCGTAGA GATCAGAAGG GAGCATTGGG TTCCCTAGGA CTGGAGGCAT 
GTGGGTGCAG AGAACTCAGC CTGGGTCATC TCAAAGCATC AGGTTCTTCT 
CACTTGCCAC TTCTCCCATT TACTGATTTT ATCTGTGTCC AGACATTCAT 
ACAGGTGGAA GTCAGGGACA ACCTATAGGA GTCAGTCCTC TCCTTCTACC 
CTCGCCTCAA ACTCAGGTTG TCGGGCTTCA TAGCAAGAGC TTCTATTTGT 
TGCTAGCCCC ACCCCATACT ATCTTTATAA TATCTGTTTA ATTAAGACAT 
TTTTATTAAC ATTCATCGTT ATCCCCTTTA CCAATTTTAC TATGTATTAA 
TTTAAATTTA ATTACTTCCT TGGCTGGGTT TTACAGGAGA GTTCCAGGAA 
GAGATGGCTC AACAGTTTAG AGCAACGGCT GTrCrrGCAG AGGACCTAGG 
GGCACTCAGA GGTGGCTCAC AATCATCTGT GACITCAGTT CCAGGGGATC 
TTCTGGGCTC CATGGGCATC AACTACACAC TTGGTTCATA GACATACATG 
ATTGATXrCAT ACATATGAAA TAAACCATAA ACAGAAAAAA AAAAGGAAGG 
AAAAAAAGTT TAAAAAAAGG AAAGGAAGGA AGGAAGGGAN NNNNWNNNNN 
NNNNNNNNNN KNNNNNNNNN NNKNTCTCTC CATACTGAAA GATGTCCACA 
GAATTTnTT TAAAAGACAA GCACAACGTT TTCTAGGGAT CAAACTCTAT 
GACTGGTGGT TTGAAGATTA CATAGCAGAG TTACATCTAA CATGAGCGTG 



AGGTTCCGCA 
GCCTAGCCGC 
TGCCTAGCCT 
TTGACTAACC 
CTGGTTATAC 
AGTCCATTTT 
CCATTTATTT 
CTCCACTGCT 
TGTCTTATCT 
TGAGTTTTTC 
GTTTTTTTTT 
ACATTTTTGC 
GAACCACCTT 
TGAGTCATCT 
GGCCCAGTCC 
GTGTGAGTCC 
TGAGCCATCT 
TCATAATGAA 
TTGCCACCCC 
GCTAGATGGA 
TTCAAGTCCT 
TGAAGAATTC 
CCAGCAAATG 
TGAGGGAAGG 
NNNNNNNNNN 
ATGACTAAGG 
TTGTGAGGAA 
TTrCCCCTGG 



420 
480 
540 
600 
' 660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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ATGGAAGCyVG TCTGATAACT TGTCTTTCTT TCTTAGTTAG CATCTCAGAG TCCCCCGCCT 
CCCITAACAT CCTTTTTGCA CACCATCTTT ITAGGAAAAT GGATCATTTA TGGGGATGTA 
GTGATTTGTA CAAGAATGTC CCCTGTGGGC TCAGATATTT GAATACTTAG TTCCCAGTTG 
GGGGAGCTTT TGTAGGGAGG TTGGGAGGCA-CAGCCTGGCA GGAGGAAGCA TGCTAGCAGC 
TTTGAGACTA TAAACCCTCA TCTACTACCT TGTTCTCTTr CTGCATTGTG CTGTGTCTGA 
CACTGTGAGA TTCCTGCTCC CGATGCCATG CCTGCCCGCC ATGATAGACT CCTAGCCCTC 
TGGAAAGGTA ACCTCAGTGA ACTCTCTTCT ATAAGTTTCT TTGCTCCTGG TGTTTTATCA 
CTGAAACGGA AAAGCTTGCA GGGAGGTAGG AGGCAGCCTG TGGCGTTGAT TCAATGCACC 
TGGCCTTATC CTCGGATGAG ATCGGTCACC AGTCAAAAAC TGTGAGCTTG AAGGTCrTGG 
GTGCTTAACA TCTATTnTA CAAATCTTAT TTAGCAACTT AGAACTGTGA AATATTGGAA 
AGCTACTTAA ACCTTCTAAA CTCCCTCCTC CACACTATGA GAATGTTACA TTTTCTATTC 
AGTTATTTTT GAGCAGTAAA CAGATGAATC AAGGAATATG CCCATCACAT CAAGAGTGCT 
CCTAAATGGA CTTGCTTGTT ATTCATTTAC AGTGTGGCCC CTTGACTTTC ATCGGCACTC 
CTAGCAGAAA ACAAAATCCG CCAGATGGAG CTGGAGAGAT GGCTCAGCTG TTAAGAATAC 
TTATCCCTAC ACAGGCCCTG GAGCCAGTTC CCAGCACCCA CACGGTGGCT CACAACCATC 
TGTAACTCCA GTTCTAGGAG ACCCGACTCC CTCTTCTGTC TGAAAACACC AGGCACGCGT 
GCGGTCTACA TACAAACATG AAAGCAAAAT ACACACATTA CATAAATAAA TCTTAAAAAA 
TGATTCGGGG TGGGGGAAGG AAAAAAAAGG ATCTTTAGAAA ATCGATGTAA CTGTTTTTTC 
CTTTTGCACA GATCTAAGTT AGGGAAGGAG AACATTCTCT TACCATCGAA AATAATTGTT 
TTCATTGCCC CCAAGTCTGC TAATAGAGCT TGCTACCTTC ATGGCTOTCG TAAGGATGAG 
GCAAAGATGG ACTTCAGCTT TCAGACTGTG TCTGCTCAAA TGITGGCTAC TCCTCTTTTC 
TGACCCCCTT CTCTGGTGCA ATCTGGACTT TCAATTAATT TCCCTGCATC TTTTACATAT 
TTGATTTAAA AAATATTTTA TTTTATGTAA TTGTATGTAT ATGCATGTCA ATAAGCATAT 
GTGTGTGTGT TTCCATGGAA ACCAAGGCAA CAGATTTTCC AGAGCTGTAG AAATGGGCTG 
TGAGACGCCC ACTGTGGGTG TTCGGAACCA AACTCGGGTC CTGTGGAAAG ACAGCGAGCA 
CCCATAATGC AGAGGTATCT CTCAGATTTT ACTTTAAAAT TTCAATTTTC TTTTTTTTTT 
T-TAAAGTTCC AAGTAACTAT AGGAAAGTAC ATGGGTATAT AGATCCCCAG TACCAAGATT 



2100 
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2220 

2280 

2340 

2400 
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2520 

2580 

2640 

2700 

2760 
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2880 
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3240 

3300 
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CTTCCTTTGC AGGTAGCACA ACTTGGTTTG TTTCACATAA AGAATGGAAA GTCATTAAAA 372 0 

CACTCATCAC ACTGTAAAGT AGAATTGAAC TCTGACAGAA CAAGCGAAGT GAGTCTGACT 378 0 

TCCAGGTAAC TGAGCCTTCT TTTCCTCCTA AAGACACAAG CCATACACAG AGTAAAATAA 384 0 

ACTTGGGCAT GGTGAGAAGG AAACAACGCA GGAGGGCTAG CCAAGTCTGA GAGTCGTGAG 3 90 0 

TGTGCTCGGT TTATAAACGG AGCCCACCTT GCCAGCGAGG TAGTCACATX3 CTCTGCTAAA '3960 

CAGAAACTTA AGAAAACACT TACACGAAGC AAACATGGGG AAGTGCCATG CAAGCATGTG 4 02 0 

ACTGACTGGT GGCAATGACC GAAACCACAG CAGCCACTAG AAAAGGAAGG GTAGTGCGCC 408 0 

ACACTGTAGT TGTGAAAATG AACTTATTCA TTTATTTTGA AAAACGTGTA AGAAGCAAAG 4140 

ATGTCTTCTT TCCCACCTAC CTTTGCGGCA GGCGAGCACT TCCTGGAATT TATAAAGTGC 42 00 

GATCTTTCTG GGGACTTCTC ATAACATTTC CTACTGCTCA TCTATGTCTG TGTCAAATAG 4260 

AGJ\ATGCTCT TGAACAAGTC TGTGTGTGTG TGTGTGTGCG CGCGCACGCG CACTCACTCC 4 320 

TGCTCTGTTG AGGTCCAGTT TTGATGGTCC CGCCAGAGGT ATATTTGAGT ATCATTTCTC 438 0 

AAGAGCTTCA GCTGGGAGAC ACTGCCTCTT ACTGGCCTGA AGGTCACTAG CTGATTCATC 4440 

TCCGTTTCGG CTGGCGCGCC TTGGGGATCC TCCTATCTCT CCTTCCCCAG TGCTGGGATA 4500 

ACAAGGTTGG CACCACATGA GCCTTTTAAA ATCTGAGITT GGAAGCTCAA ACGCAGGTTT 4560 

TCATGCTTGC ACTGAAACTT CACAAGCTGA ACCGTCTCCC TCTCCTTCCC TCTCTTTTTT 4620 

CCTTTTCTTC TTCCTTTTTA AAACACATCT TGTCTTTAAA AAAAAAAAAA GGCCCAAAAC 4680 

AAGTCTAAAG TATTTCCCTA TGTGTGTGGA GGGAGGGAGT ATAGGAGGCT GATTTCACTG 4740 

AGATCCTGTT AAATTTGGGT GCCATAGCCA ATCAAAGACG CATCGTTTCC TCTAAG^TT 4800 

CTAAATGGGG CGATTACCAC GGGCCTGCAG GTTCTGGrrT GTATTAGAGG AGACACTGTC 4860 
TTCITAAGTA AAACATAGAA GGGGAAGTGT CCAGAATTGT AAATAAGGCT TCGAGAGAAG 4920 
CCTTGTCTGG CCACCGGGAT GGAGAAGACC TACCTTCGCC TATCCAGGAT ' CCATCGTCCC 4980 
TCCCTCTACC CAGATCTGAC AGCCCTCCTT GGCTCrTTTG CTGAGGTTTG TTTGAGTTTG 5040 
TTTTACTCTC TGCAAGAGAA GTTTCCTTAA ACATTCTACC CTGTTCACAA GTAAATACAC 5100 
CTCTTAGCTA AGAGGCCACA CACCCAGGGG GAACACCGAT AAAAAGAACA AGCCAGAACC 5160 
TTCAGAACGC TGTCGATAGG TACACCAAGC AGCCTTCATA CGGAGTTTTC ATTCGTGAGG 522 0 
AGCTGAATAT ACAACAAAGC TAAATGTGAG CAGACCAGGC ATGCCTCTGC TAAATGAGGA 5280 
TGCCCACACC AAACATGCCC AAGATCTTCA AGTATAATTT TATTATATAG ATTCGCTATG 5340 
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TGTTGACATG 
TCTGGCACCA 
TTTTTTTTAA 
TCCGATGTAG 
TCATTCTAAA 
ATTTCCCAGG 
TCTCTAGTCG 
ACCGGTTTCC 
CCCCAAGCAT 
AGGGGACCGT 
CCGGGGGCTC 
TCCTTGCTAT 
AGGGCTAATC 
GGCGTTGGCA 
ATGCGGTGGC 
TTGTGCCCCG 
GATTTTTGGT 
CCTAATGAGG 
GCAAGTGTCT 
CCAGATTTGC 
GGCGTGGGAA 
GAGTCCCCAC 
TCTTTCTGCC 
GGACCGCTAC 
TGACCCGATT 
GCTCTCCCCG 
GGGACGGAGA 



TTTTTATAGT 
TACTTGAGGC 
AGAAAAGCAC 
AGACAATAAT 
TGCATTTGGT 
CGTGGAAGGA 
CTACTTGAAG 
TCTTTGTTCC 
ATCTTGGCTG 
GGTCATGGAA 
TTCTCTGTGC 
GGGTTTTCTG 
AGGTAACTTC 
ACCTGGATAA 
CAAAGCACCA 
AGTACATCTA 
CCCTTGGGAC 
GCGCAGTGGG 
CAGAGCCCAT 
TCTCAGATGC 
ACCGGGAAAC 
CCCTCCCGGT 
CTGAGTCCTC 
CCGGCAGGAC 
CCGCGGACAC 
GTCTTGCGCT 
AGGAGTCTGT 



GAACCTGGAT TTTACAAACC 
TTAGGCACGT GATAAAGGAG 
CATGTTACAT CATTAATCAT 
CTTATCTCTT--TGTCTGGCTG 
TTTTGCCAGG AGTAAAACAT 
AAGGAATGGA AAGAAAACCA 
TCTACATAGC TGGGGGGGGG 
TACACTGGCG CCTCTGGCAA 
AAAGGTCAGC TCTGAAAAGG 
CTGGGTAAAC AAAAGCACTC 
ATTTGCCCTG GAACCCTGAC 
GACCGACCCA GCCAGGAAGT 
GGACGATTTA AAGTTGCCAG 
GCGCCTATCT TCTAATTAAA 
TAAAACAAAA CTTCCAAGTA 
GGTTCAGGGG TTCTTGTCTT 
TTTCAGTGCA GCGGCGAAGA 
CCTCGTGTTT CTGGTGATGC 
TACTGGCTAC ATTTTACTTC 
GACTTGCCGC CCGGCACAGT 
CCAAAOCTGG TATCC7VGTGG 
AATGACCCCG CCCCCATTCG 
AGGACCCCAA GAGAGTAAGC 
TGAAAGCCCA GACTGTGTCC 
CGCTGCAGCC GCGGCTGGAG 
GCGGGGGCGC ATACCGCCTC 
GCCTGAGAAC TGGGCTCTGT 



CTCCTGGTTT GCCACCTGCT 54 00 

CATGCCTGTT TCCCCCCTTA 5460 

GCATATCAGT GTAGTTTAGA 5520 

AAAGACTGTC CTTTAAACTA 5580 

GTCACAAGAT ATTTGTTGTC 564 0 

GGGGTGAAGG CTGCTGTTCC 57 00 

GGGGGGACTG TTCACATGGG 5760 

AAAACTCTCC CTTCTCTTCC 5B20 

GGCCTGGCCA AAGTTACTGT 58 BO 

TAGCAGCCAC TGGAAAAGGA 5940 

CACCGCCAGC TCCCTGCATC 60 00 

TCACAACCGA AATGTCTTCT 6060 

ATGGACGAGA AAACAGTAGA 6120 

ACATTCAGAC GGGGCGGGGG 6180 

CT6ACCAACT CACTGCAAGT 6240 

CATCCTCCCA ACTGCGGGCG 6300 

GAGTTCTGCA CTTGCAGGCT 6360 

TTCCCAGGTT GCTGGGGGCA 6420 

CACCAGAAAC CGAGCTGCGT 6480 

TCCGGGGTAG TGGGGGAGTG 6540 

GGGGCGTGGC CGOACGCAGG 6600 

CTAGTGTGTA GCCGGCGCTC 6660 

TGTGTTTCCT TAGATCGCGC 6720 

CGCAGCCGGG ATAACCTGGC 6780 

CCAGGGCGCC GGTGCCCCGC 6840 

TGTGACTTCT TTGCGQGCCA 6900 

GCCCAGCGCG AGGTGCAGGA 6960 
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TGGAGAGCAA 
CTGTGGGTAA 
TGGGAATAGA 
TTCACTTACC 
TCTACTCTCC 
CTAGGTAGGC 
GAACAGGCGG 
ACTCGGTGCT 
CCCTTTAGCT 
ACACTTTCCT 
CGCGGCTTTC 
ACCGACTCCA 
GGCACCTAGT 
CACACTCGCT 
CTTCCCTTCC 
TCTTCCTCCT 
CTCCACCCAT 
TTGCGCCCTA 
CGCCTGTCTA 
CACGAGTGCC 
CAGCCTGGAA 
AGAATTCCTA 
XTTTGGAGTG 
AGACAGAGCC 
ATTAATTAAT 
TGTACCTCTT 
ATGAATGAGG 
ATTGATAACC 



GGCGCTGCTA GCTGTCGCTC 
GAAGCCCACT CTTTAGTAGT 
AGAGGACCTA ACTCGTAGAG 
AATGGGGAAA CTGAGGTTCA 
CCCGGGCATC GGTTGGAGCG 
AGGAGTGGAA AGGCGGCCTG 
TCAAACGCTG GGAAGCCAGC 
GTTGCCCACC GCCCTAGTGG 
CCAGCGTGTA AGCGCACGGA 
CCCTCAACTT TCTTTGATGC 
TCTTCTGTGA . CAGCTTCTCC 
AAGCGCTGGT CCTGGGGTAA 
TATAGGTCAC TTTTCTTG1T 
CTTGGGCTTT TGTGCAATAA 
CTTCCCTTCC CTTCCCTTCC 
CTATTTCTCT GTCATTTCCT 
TGGAGAATGG AGAATCAGGA 
AGAGAATCGC ATGGCTTAAA 
GAGCTATTGA GTTTTCCTCA 
CTCTTGTTCC TGAGAAGGCC 
CATGTGCTTG CCCTGGGTTC 
GTGTAGCAGC TTAACAGCGT 
TGTGACTTGC TTAATTGGAT 
GCTGTTGTTT TTCCTTCTGG 
GTTGGTTATC GGATTTGAAC 
GCAGTCAGAA TGAGCTGTCT 
TTTAATAAAG GCGATGGATG 
TCATTGACTG GATTTGATTG 



TGTGGTTCTG 
AAGGCGGAGA 
CTCTAGAGAC 
AAGACTCTTC 
TGTCCTGCGG 
AGCCGGGGCA 
TCAAGCCAAG 
GGGGCAGGGA 
CTATGTGAGG 
AGAATGCTAT 
AGGTGTATTA 
ACTCTGAAAG 
TTAAAATGCC 
TTTCCCTTCC 
CTTCCCTTTC 
TTTTGAAGCC 
AAAAAGAAGT 
AGCATTGGCA 
TAATOACTTG 
GCACTCTCCC 
CATAGAGAAG 
CCCGTTCTCT 
TGGGCTATAA 
TCTTTGAGCG 
ATAAAAGGGC 
AAAGAACAGA 
AGCATTAGTC 
CAGACATGTA 



CGTGGAGACC 
AGTAGGGTGC 
CCTCCTCCCT 
CGAAATGACT 
AGCCGTCACA 
GGAGATGCTC 
CGGCCCGGCT 
ATCCGCCTCT 
GTAGGTCTCT 
TTTTGCTGGT 
AACTAAATAA 
TCTCAGAAAC 
CTCTGCTTCA 
CTTCCCTTCC 
CCTCTTCCTT 
ACAGTTTGCA 
CAATTCTGCA 
CTGACATACG 
GTTCATCAGG 
CCTTTCTGGG 
CAAGTTGCTT 
OAATAAGATG 
TTGGTGCCAT 
GGAAGGATAA 
TTTTATTGTA 
ACCCAAACTT 
ACTOATGTAA 
TTGGTATGGG 



CGAGCCGCCT 
GGGCGGAGAG 
TGGGTGTTCT 
CAGCCAGGAT 
GCCCCTGGCG 
CCACTGGCAG 
GGCATCAATC 
GGCTCCGCTC 
TCATAGAGCA 
AGGAGGAAGA 
CTCTCCACTT 
TCTTGAGCTT 
AGGTTAGGCC 
CTTCCCTTCC 
TTCCTCCTCC 
GATTTCCAAT 
GAAACATTCC 
GCGCCAAGAT 
CTAGCTCCAC 
AAGAGAAAGA 
TAAAGCCCAG 
GAGGTTGCCC 
CCAAGTCTCG 
CAGTGCACAA 
TAGTAGCATA 
GCCGATGAAA 
ATCTCCAGTT 
GCATCCTTTA 



7020 
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7200 
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7440 

7500 

7560 
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7680 

7740 
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AACy\TGAGCA TAGCCAACGT GCCTGCACTC TAAOAGAATC TATGGCTQTA TGTTATTACA 87 00 

GAGACAGTTG AGAAGCTCTT AGTGGCTCTG GCGTGTAGAT CAGCGGTAGA GCGCTGAGGC 8760 

TCTGCGCTCG CTTCCTGGCA CTGAAGAATA AAGGCCATTT ACTGTGGTGG TGCAGTGGGC 8820 

GCAGTTTGTG ACQAGTTACT ACTACATTTT-CCTCACACAT CTGCCTGACT AATGAGTTCA 8880 

TCAGATGAGC GTATCCAGTG ATTGTlTGCA GGTTAATGGT TCTCAGTCAT GTTTAGAATC 8940 

TACTTATCAA ACAAATTGTT ITCTCATTTC CTGCTTCTTC TCAAACAAAG TAAGATTCCA 900 0 

TTATTGAAAG GCTTGTTTAA GAGCATTTTA ACTGCTTGCC TATGTTAGGG ACAGTGACTT 9060 

ATTTCATATT GACAAATATT ATGCCGATTA ATTGAATATG ACTACCCAGT TCTATAGCTG 9120 

TCTCAGGGCA GACCAAGAGC ATCTGTGATC CAGTCACTTT AAATGCCATT TAAAATGCAT 9180 

AATTTGTTGG TCTAGGAATA AACACACTGT AAAGTTTAGA ATCACGGCCC AAACACAAGT 924 0 

CTTTAACAAT GCCAACTAGC TTCTGAGATT CATTAATGTC ATTTAATTAC CAATGTTTTA 9300 

AAAATATGTC ATTAATTACT AAATCTATAG ITGTAACAGC AACACATGTA CATCTTATTA 9360 

AGTTGGGTAT ATTCAGGGTG GCATAGCTGT AGACTATTGC ACATCTGTGT TGGTGAGCCA 9420 

GTGGAGAACT GCCTCCTGGC TGTTCTCAGA AGGCCACAGT GTCACGGCAT TGGCTATTTG 9480 

CCrTGGCrCT TTGCTAATAC TTTATTGACA TGGCCTCATC TTCGTTCACG TTCACTTATT 9540 

TGCCCAACAA CGTCAATGCC AGCTGAGGCC TTAGGAGTCA TCTGTTCTTA GTCAGTGCGA 9600 

ATTAGAAAGC CTGGATGCCT GCCTGCTATT AATTAGTTAT TCTTCTCTTC TGAGACAGAG 9660 

TCTCACTGTG TGGCCCAGGC TAGTCTCAAA CTTGCGGTCC ATTTGTCTCA CTCATCAGAA 9720 
TGCTGGGCTT CCAGGTGTGT GCACCACACT AGGTAGCTCG CGTTTTAAGC TAAGAGCTGG 9780 
AAGATCCTGA TGTCCTTTAC CATGGTGGGC ATGTTACAGG TTAGTTGACT GAAAACTAGT 9840 
TATCTCGCTG TCTAATGACC TGCAGTGGTA TGTATCTCTC AAGATGCTTT TTTGCATTTC 9900 
AATCAGTTAG GTAACAAGIT CTTAAGTCTC CAGCTTGGTA TTGGCATGAG CTCAGAGCTT 9960 

TGATTAATGA GTTGGGACCC CCTAGCTATT GCTCATTAGA CTTACACTAT TTTTAGTTTT 10020 

GCTCTGAGTT TATCAATATG CATGTATGCA TGAACTXGGG AGATATTTTT CTTCCCCAAT 10080 

TCCTTTTCCT CCATTTAAAT GTGCTGTCTr TAGAAGCCAC TGCCTCAGCT TCTGCAGCTC 10140 

AGATACCAAA GGAAGTCT6G TACACAGCAT GATAAAAGAC AATGGGACGG GGTCACAGTG 10200 

GCTCCCGTCC CTTTCAGGGG TATGGAGACG AGCTGTAGAG AGATGTCTCC AGGGAGTTTT 10260 
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CATTAATCAG CAATTTAGTC AGATCTGTGC ATCCTATGCT TTACAAGAAA TGTCAGTGGG 10320 

CCTGAGATCA TCAGATGGAG GTTCATCGGG TTTCAATGTC CCGTATCCTT TTGTAAGACC 103 8 0 

TTCAAGTTGG CAACGCAGGA AAACAGGAAC TCCACCCTGG TGCCGTGAAT TGCAGAGCTG 10440 

TTGTGTTGGT TTGTGACCAT CTGCCCATTC TTCCTGTTAT GACAGAGCTT GTGAACTTTA 10500 

ACTGGGACTG GGGCAAAGTC AATCCCACCT TTATACAATG AATTGCTGAA GAGGCCTTTT 10560 

AAAACTTGGA GTGTGCATTG TTTATGGAAG GGCTTTCCTA TTGGATCCAA CTCTTTTCTA 10620 

ATTTGTrrCT AGGTTTGCCT GGCGATTTTC TCCATCCCCC CAAGCTCAGC ACACAGAAAG 10680 

ACATACTGAC AATTTTGGCA AATACAACCC TTCAGATTAC TTGCAGGTAA GGATTCCTTT 10740 

TTGAGCCAGC TTTCCTATGT GAAAGGACTC ATTGTTTACT GAGGTCACAA CAATTTCCAC 10800 

TATTGCAGAA GTATAATAGT ATTGTTACAA TTGTTTATAA ATCATGAGAC TTCTAAGAAC 10860 

CTATTTAATA ATGAAACAAT GGAAAAAGTC TTTTCAAACC TTTGTACTCT TTTGCTGAGC 10920 

CGTTTTCAAC ATGCACAAAC ATATTACACA AATATAACAT ACACAGGAAC ACACATGAAT 10980 

GCATCGGATG ATGTGCCTAA AACTAGCATG TAATTX3ATAT TCACAATTAT TGATAAATTA 11040 

GTAAAGCAAA GGAATTCCTT ATGAATAGAG CTAAAATTCT ATCCATGTTC AAGTCACCCA 11100 

GAATGGCTTC TGGACATTTT TTTTTTTAGC TGTTTTCTAC AAGTGAAATT CTGCCTGTAT 11160 

TAGCAATTTA ATATCTAGCC AATAATAITC CTGACCATAT GTCCTGTTCA GACCATGACC 11220 

TTCATAATCT GGCTTGATGT TCrGGGCTTC TTTCCCTCTT GCCAGCAAGA TGTCACGGTG 11280 

TTGATGCTGG ATAAACTGAG AAACAGAAGT TTTTCGCAAG AAGAGGACCT TGAATTTTGC 11340 

TTTTCCCCTG AGAGACAAGA AAGGAAACTT AGAGGAGGTG TAGCTGGGAG TGTGGTCATT 11400 

CATGAAAGAC CTGTTTGCAG GGCAGTGTGT TTTGCTGGGG ACAGTAATGA GCCTAGATCG 11460 

TAGTGCCATC CCAAGAGAGT GCTTGGTGGC AAAAAGAGCC CTAGCAGCTT GTGGCAGTTG 11520 

CCTCATATTT GAAGAATACT AAGAGGTCCC CCGAATAACT CAGGGCTAGT GTTGATCATT 11580 

GCATGTGGAG AGAATCCAAG CCTCCTATCT AGGGTCTACA AAAGTAACCA ATGCCCAGTC 11640 

TTTGGGGG/VA AGCAAAACCA GAAAGCGATG ATAGCAGGAC CTGTTTATTT TCATTAAGTC 11706 

ATGGCATTTC CAGAGACTTT GCTCCCCCTA TTCTCAGACA CAAAGCCCAC TTAAGATCTC 11760 

CCTCTGGAGA CTGCTGGGAA CATTTCTTAA GTTCTGAAAA AACCCTGGAG T6ATTGGGCA 11820 

CAGACGATCC TGTCACTTCA TGTGAGTGCT AAGCTCTTTG GGTGATGACT CAGTGGGTCA 11880 

CATTGTTTTA TTCATATTGA CTACCTTCCG TTTGCITTGC GGAGAATGGA AGCTATAGAA 11940 
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rJTCTGTTTGG 


TGTGGCCCTC 


ACAAGGCACT GTOAGCTTCT 


TCTCTCTGTG TGCTAACTTC 


12000 


TT^ AP TCCC 


TTGCTTATAC 


CCACATAGGG ACTCTGGCTT 


TGTTGCTGTT CTTCAATGCT 


12060 


Tr* H fl ATOTG C 


CCTGGGTCCT 


GTCTGTCCTT CACACTTACT 


GATGCTGCCT GGAATGCTAT 


12120 




GTGCATAGGG 


CCAGCTCGGT-CCAAATCCTC 


TCTTTTCTTT GCCTCTTTTA 


.12160 




CACAGTATCA 


AATCACCACA GTTTATGCAA 


CAAACTGAAA CTTTAAAATT 


12240 




TTATATTAGT 


GATAGGTTCC AGAAAGGCAC 


TGATTTTTTT TCITCCCTGG 


12300 




GCAACTACTC 


TACCACTGAG CGTGATATCC 


TTGGTCCCTT AAAAGTTATC 


12360 




AATAATGCTT 


AGCAATCATA TTTGCTTAAA 


ATATTTATTG AATGACTGCA 


12420 


GGAATGAATG 


AATGAATGAG 


CTAACAGAAA ACTCATGACC 


ATGTGGGTGA TTTCCGAAAC 


1 1 >i o n 


AGAGTGTGAG 


ATCTTTGGTG 


GCATGTCCTT GTAGACTGTC 


TGCCACCAGT ATCTATCATC 


12540 


TTGAAGGTGA 


CTATTGAGTA 


GITTATATGC ATGTGAAAAA 


CCAAACCTTC TATTCTCTTA 


12600 


CTCATAGCCT 


CTCTTAATCA TAGCCCTGTG GCATGGAGTG 


TACCATTGAT ATCTTCCTGG 


12660 


AATACTTTTT 


CAGGGGACAG 


CGGGACCTGG ACTGGCTTTG 


GCCCAATGCT CAGCGTGATT 


12720 


CTGAGGAAAG 


GGTATTGGTG ACTGAATGCG GCGGTGGTGA CAGTATCTTC TGCAAAACAC 


12780 


TCACCATTCC 


CAGGGTGGTT 


GGAAATGATA CTGGAGCCTA CAAGTGCTCG TACCGGGACG 


12840 


TCGAC 








12845 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GGGGTACCGA ATTCTAAATG GGGCGATTAC C 



31 
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Example 11:The role of HIF-2a In Flk-1 regulation 

The Flk-1 promoter (-640 bp/+299 bp) confers endothelium-specific expression to 
the firefly luciferase reporter gene in transfected bovine aortic endothelial cells 
(ROnicke, 1996) and provides for a strong reporter gene transcription in vivo; see 
Examples 6 to 10. This suggests that transcription factors that are specifically 
expressed in endothelial cells activate the Flk-1 promoter in a cell-type specific 
manner. 

The basic helix-loop-helix PAS-domain transcription factor, HIF-2a (also known as 
HLF. HRF or EPAS1), is expressed in endothelial cells during mouse embryonic 
development (Ema. Proc. Natl. Acad. Sci. USA 94. 4273-4278, 1997; Flamme, 
Mech. Dev. 63 (1997). 51-60; Tian, Genes Dev. 11 (1997), 72-82) and is thus a 
candidate to regulate Flk-1 expression. HIF-2a has previously been shown to 
stimulate both the expression of VEGF (Ema. 1997) and 77e2 (Tian, 1997). To 
detemiine If HIF-2a might be involved in the regulation of Flk-1 gene expression. 
A293 cells were co-transfected with a luciferase reporter gene constmct containing 
Flk-1 promoter sequences (-640 bp to +299 bp) and an eukaryotic expression vector 
that contained the mouse HIF-2a cDNA. Mouse HIF-2a and HIF-1a cDNAs were 
obtained from a mouse brain capillary endothelial cell cDNA library (SchnOrch. 
Development 119 (1993). 957-968) with a 300 bp BamHI/Ncol fragment spanning 
the 5'UTR of HIF-1a. Positive phages were rescreened, and inserts were amplified 
by PGR using oligonucleotides HIF Start (5'-GGGAATTCACCATG 
AGTTCTGAACGTCGAAAAG-3'; SEQ ID NO: 18) and HIF Flag Stop: (5'- 
AAGCGGCCGCTCATTTATCGTCATCGTCCTTGTAATCGTTAACTTGATCCAAAG 
CTCTG-3'; SEQ ID NO: 19). The PGR product was digested with Eco Rl and NotI 
and subdoned In the EcoRI and NotI restriction sites of pcDNA3 expression vector. 
The murine HIF-2a cDNA was obtained as described (Hamme, 1997). The phage 
insert was amplified by PGR using oligonucleotides HRF START (5- 
GGGAATTGACCACAATGAGAGGTGACAAGGAG-3': SEQ ID NO: 20) HRF rev (S"- 
AAGGGGCGGCTGATTTATCGTCATCGTCCTTGTAATGGTTGGTGGCCTGGTCCA 
GAGGTGTGAG-3'; SEQ ID NO: 21) and PGR product was digested with EcoRI/NotI 
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and cloned into the EcoRI and NotI sites of pcDNA3. The sequence encoding the 
FLAG epitope was included in the reverse oligonucleotide primer. HIF-2a and HIF- 
1a expression plasmids were constructed by inserting the FLAG-tagged cDNAs into 
the EcoRI and NotI sites of pcDNA3 (Invitrogen). For co-transfection assays, A293 
cells were split 1:2 into 35 mm dish^ and transfected 18 hours later with 4 of 
DNA (2 ng of Flk-1 promoterndriven luciferase plasmid, 1 ng of CMV promoter-driven 

P- 

galactosidase expression plasmid, and 1 Mg of the HIF-2a or HIF-1a expression 
plasmids, or pBluescript SKII and pcDNA3 as a control) using a transfection l^it 
(MBS, Stratagene). After 20 hours, reporter gene activity measurements were 
performed using the Dual Light Kit (Tropix. Bedford, MA). The luciferase activity of 
each extract was normalized to the respective p-galactosidase activity. Endogenous 
background levels of both enzyme activities were measured using extracts from 
mock-transfected cells and were substracted. The nomialized luciferase activity of 
the control transfection was arbitrarily set to 1 . Each value represents the average of 
at least six experiments. 

In comparison to cells transfected with the luciferase reporter construct alone, co- 
transfection of the HIF-2a constnjct increased reporter gene activity approximately 
15-fold (Fig. 15). In contrast HIF-1a , a dose relative of HIF-2 which stimulates the 
hypoxia-induced transcription of the VEGF gene, failed to stimulate the reporter 
construct (Fig. 15). These results suggest that HIF-2a regulates the expression of 
the Flk-1 gene. 

Summary 

The mouse Flk-1 receptor is cmcial for the differentiation of the hemangioblastic 
lineage and during embryonic vascular development (Risau, Annu. Rev. Cell Dev. 
Biol. 11 (1995), 73-91; Shalaby. 1995; Risau, 1997). Moreover, Flk-1 plays a central 
role in the regulation of neovascularization in a wide variety of tumors (Plate. Brain 
Pathol. 4 (1994) . 207-218; Ferrara, 1996). To elucWate the basis of Its endothelial 
expression, regulatory sequences f the murine Rk-1 g ne have been isolated and 
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Characterized that confer endothelium-specific reporter gene expression in 
transgenic mouse embryos. Transgene expression driven by these sequences was 
strong, specific, and highly reproducible. Most importantly, it has been demonstrated 
that the isolated sequences were active in early stage vascular development and 
may thus represent a clue towards'the identification of the molecular mechanisms 
Involved in hemangioblast differentiation and vasculogenesis. Moreover, transgene 
expression persists until shortly after birth and is downregulated in adult animals, as 
it was described for the endogenous Flk-1 gene (Millauer, 1993; Kremer, 1997). 
Endothelium-specific expression in almost all transgenic mouse embryos tested was 
mediated by a 939 bp fragment of the promoter region In combination with a 
fragment of the first intron. 5'-flanldng fragments up to -5.5 kbp alone were not 
sufficient to confer a reproducible endothelium-specific transgene expression. 
Reproducible endothelium-specific expression was therefore dependent on 
sequences from the first intron. These sequences also activated the heterologous tk 
promoter specifically In endothelial cells in vivo, and were active in an orientation 
independent manner. Thus, they fulfill the criteria for an autonomous tissue specific 
enhancer. 

As demonstrated in Example 8. the intron sequences that were sufficient for 
endothelium-specific expression were contained in a 510 bp fragment. Several 
potential binding sites for known transcription factors could be identified therein (see 
Figure 12), including consensus binding sites for c-etsl. PEA3 (an Ets-like 
transcription factor), GATA transcription factors, and Scin'al-I. The c-etsl 
transcription factor was proposed to be involved in the eariy differentiation of 
endothelial cells from their precursors (Pardanaut, Cell • Adhesion and 
Communication 1 (1993). 151-160). In addition, c-etsl is expressed in endothelial 
cells during tumor vascularization and other forms of angiogenesis in humans 
(Wemert. Am. J. Pathol. 140 (1992), 119-127). Proteins of the Ets family can 
activate transcription through a PEAS motif (Wemert. 1992). Transcription factors of 
the GATA family are involved in the transcription of genes that are expressed in the 
hematopoietic and ndothefial lineages, such as von Willebmnd factor (Jahroudi. 
Mot. C II. Biol. 14 (1994). 999-1008). Unlike the hematopoietio-transcription factor 
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GATA-1 . GATA-2 is expressed in both the endothelial and hematopoietic lineages 
(Elefanty. Blood 90 (1997). 1435-1447). Scl/Tal-1 has recently been implicated in the 
regulation of Flk-1 expression in Zebrafish (Liao. Genes Dev. 12 (1998). 621-626). 
The presence of two potential Scin"a|.1 binding sites in the murine Hk-1 intron 
enhancer suggests that Sd/TaH might regulate Flk-1 expression in mice. However, 
no direct effect of Scin^al-I on Flk-1 expression has been observed so far in mice, 
although Scl-null mice have vascular defects (Visvader. Genes Dev. 12 (1998). 473- 
479). 

Recently, analyses of the regulatory elements of other endothelium-specific genes 
such as von Willebmnd facfor (Aird. Proc. Natl. Acad Sci .USA. 92 (1995). 4567- 
4571). c-efs-f (Jorcyk. Cell. Mol. Biol (Noisy-le-grand) 43 (1997). 211-225) or the 
endothelial receptors. Tiel (Korhonen. Blood 86 (1995). 1828-1835) and 77e2 
(Schlaeger. Development. 121 (1995). 1089-1098; Schlaeger. 1997) have been 
reported. The most uniform expression pattern reported was conferred by regulatory 
elements of the 77e2 gene. However, in contrast to Flk-1. expression of 77e2 and of 
reporter genes driven by 77e2 regulatory sequences is not downregulated in adult 
animals. Such as in the Hk-1 gene, the first introns of the 7fe2 gene and of the Ets-1 
gene are involved in endotheliunv-specific expression. Similar to the Hk-1 intron 
enhancer, the first intion of the TiB2 gene also contains an autonomous endothelial 
specific enhancer. A major difference between the stmctural organisation of the 
regulatory elements of the Flk-1 gene and the 77e2 gene is. however, that the 77e2 
promoter by itself Is active In certain embryonic blood vessels (Schlaeger, 1995), At 
least during the developmental stages analyzed (i.e. E10.5 and El 1.5) an 
autonomous function of the Flk-1 promoter was not observed. The Intronic 303 bp 
Tie2 core enhancer also contains potential binding sites for transcription factors of 
the Ets and GATA families (Schlaeger. 1997). and c-etsi or PEA3 binding sites are 
present in the promoters of Tiel, Tie2 and FH-1 (Kothonen. 1995; Schlaeger. 1995; 
Wakiya. J. Biol. Chem. 271 (1996), 30823-30828). 

Analysis of Flk-lllacZ knock-in mouse embryos that express the lacZ gene from the 
endogenous Flk-1 I cus has previously shown that the ladZ. reporter gene is 
expr ssed ubiquitously in the developing intra-embryonic vasculature and the yolk 
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sac Of E7.5 embryos (Shalaby, 1995). However, in accordance with the present 
invention it was found tliat a fragment of the 5' UTR that is deleted in the knock-in 
construct is required for reporter gene expression in the yolk sac vasculature during 
later stages of embryonic development. Based upon transient transfection analyses 
in bovine aortic endothelial cells/the Flk-1 5'-UTR has been shown to contain a 
positive acting, endothelial cell-specific element between nucleotides +136 and +299 
(Rfinicke, 1996). The complete vascular staining of the Flk-HlacZ knock-in embryo 
proper at E11.5 derrmnstrates that the 5 -UTR is not essential for intraembryonic 
vascular expression at this developmental stage. 

The involvement of HIF-2a in the regulation of Flk-1 expresston further emphasizes 
the role of basic helix-loop-helix/PAS-domain transcription factors in the regulation of 
components of the VEGF signal transduction system and of vascular development. 
The upregulation of VEGF in response to hypoxia is generally thought to be 
mediated by HIF-1. Moreover, mouse embryos lacking functional genes for HIF-1a 
or ARNT show defects in vascular development, perhaps due to reduced VEGF 
levels (Maltepe, NaturB 386 (1997). 403-407; Iyer, Genes Dev. 12 (1998). 149-162). 
This observation indicates that the physiological relevance of these transcription 
factors is not restricted to adaptation to hypoxia, but extends to the regulation of 
normal vascular development. HIF-2a is expressed in various tissues, including the 
developing endothelium of several organs, for example in the brain (Flamme, 1997). 
It seems therefore likely that HlF-2a is involved in the regulation of Flk-1 expression 
in blood vessels that co-express H/F-2a and Flk-1. Interestingly, H/F-2a Is also 
expressed in tissues that express the Flk-1 receptor ligand. VEGF, and has been 
shown to stimulate VEGF expression (Ema, 1997). Taken ' together, these 
observations support the hypothesis that HIF-2a is both an intrinsic and extrinsic 
regulator of blood vessel growth and function (Flamme, 1997), by stimulating, both 
receptor and ligand expression. The expression of VEGF and Hk-1 shows a 
remaricable coordinate temporal pattem both in development and in tumors. For 
example. VEGF and Rk-1 are expressed transiently in the developing mouse brain, 
and are largely down-regulated in the adult, but reactivated in brain tunfX)rs (Plate. 
1994). In hemangtoblastomas of the brain, which are highly vascularized tumors. 
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both the VEGF and F\k-1 expression are highly up-regulated, and this correlates with 
the up-regulation of H/F-2a expression in the stromal cells of this tumor type. 
Whether HIF-2a contributes to the remarkably coordinated expression of VBGF and 
F!k-1 in other tumor types, remains to be established, since for example in 
glioblastomas - another cerebral tumor - up-regulation of VEGF is due to hypoxia, 
and HIF-2a is inducible by hypoxia. Unlike the expression of VEGF and Ffl-t, Flk-I 
expression is not directly stimulated by hypoxia (Gerber, J. Biol. Chem. 272 (1997)» 
23659-23; Kremer, 1997). Thus, the primary function of HIF-2 in the regulation of 
Fik'1 expression does not appear to be related to the hypoxia response. 
Among the endothelial RTK identified thus far, Flk-1 is the only receptor whose 
function is required for the determination of the endothelial lineage. Therefore, the 
Flk'1 gene represents the ideal candidate for studying the transcriptional regulatory 
mechanisms that are active during the emergence of the endothelial lineage. The 
observation that the isolated regulatory elements of the Flk-I gene are active in eariy 
stage vascular development are of great importance for this objective. Knowledge of 
the Ftk-1 gene regulatory sequences is also of great potential relevance in the 
therapy of certain diseases. The Flk-1 receptor has been demonstrated to be a key 
regulator of angiogenesis in various diseases, including cancer (Plate, 1994), 
Therefore, the study of the regulatory elements involved in the upregulation of Flk-1 
expression in the tumor endothelium appears to be particulariy relevant for studying 
the mechanisms of tumor angiogenesis. Further studies will unravel whether the 
same regulatory elements of the Flk-1 gene that confer endothelium-specific 
expression in mouse embryos are also active in the tumor vasculature. Flk-1 gene 
regulatory elements active in the tumor vasculature may provide ihfomiation about 
the signaling pathways that can be targeted for anti-angiogenic tumor therapy. 
Finally, the Flk-I gene regulatory elements will be useful for targeting expression of 
genes to the vasculature. An attractive possibility is the expression of suicide genes 
(Ozakl, Hum. Gene Ther. 7 (1996). 1483-1490) under the contrcil of these elements. 
The use of the Flk-1 gene regulatory elements in combination with, e.g., the 
Cre/loxP system may provide a powerful tool for specifically inactivating genes in the 
developing vasculature or in tumor endothelium. 
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The present invention is not to be limited in scope by its specific embodiments 
described which are intended as single Illustrations of individual aspects of the 
invention and any DNA molecules, or vectors which are functionally equivalent are 
within the scope of the invention, indeed, various modifications of the invention in 
addition to those shown and described therein will become apparent to those sl^iiled 
in the art from the foregoing description and accompanying drawings. Said 
modifications intended to fall within the scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

<i) APPLICANT: 

(A) NAME: Max-Planck-Gesellschaf t zur Foerderung der 

Wissenschaften e, V. 

(B) STREET: none 
<C) CITY: Berlin 
(D) STATE: none 

(B) COUNTRY: Germany 

(F) POSTAL CODE (ZIP) : none 

(ii) TITLE OF INVENTION: Regulatory Sequences Capable Of Conferring 
Expression Of A Heterologous DNA Sequence In Endothelial 
Cells In Vivo And Uses Thereof 

(iii) NUMBER OF SEQUENCES: 21 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE; Floppy disk 
<B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D> SOFTWARE: Patentin Release #1,0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12B45 base pairs 

(B) TYPE: nucleic acid 
<C) STRAHDEDNESS : single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TCTAGAATAT AGAAGATAAG TTTGCGTACA ATTCAGTCCT TTGAAGACCT GATAAGCTTIT 60 

AAGAAGGAAG ATGGGT-TACA CATTGGGAAA TGGTTGCAAT CTGCACATGG CAGAGGCAAG 120 

AGATGCAAAT CACATTTCTT ACATACTCCA TACAAATCTT ACAAGACTGT TTTTCTTTCT ISO 

CATTTAAAAT AAGAAGACCT GCCAGTCTTC CCCTTATTAC TAATTACAGT CACTCTGTAT 240 

CTTTGTTGAC ATTGGATAGT TTTACATACT TCAACAGGCT GGTGTCATTA AAGTTGTGGT 300 

GGOTGGGCAC CAGAGACACG TGATTCAGAG TGGGAGOAGA TGCAGGAGAA ACGAGGCACA 360 
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GCAGAAGCAG AAGCGAGGAA AAACACTCTC AACGTTACTA ACACATCGAG AGGTTCCGCA 420 

CACTAGCAAT ACGGGCTGAA TCTGACCTAA TCTCTGCTGT TGAAAATTTT GCCTAGCCGC 480 

ACACTAGCAA TACGGGCTGA ATCTGACCTA ATCTCTGCTG TTGAAAATTT TGCCTAGCCT 540 . 

GTCACACAAG TGCTGAGCAT ACAGAAAAAG^GAGAGTAATT CTCTGGTTCT TTGACTAACC 600 

AAATAGTCTA TATCAAATTG CCTAAGATAA TGTATACATT TAGTACATGA CTGGTTATAC 660 

CTATTCTATA TGACTATTAT TTAAATGTGA ATTTACAAGT GAGCATATGA AGTCCATTTT 720 

ACATGGCTAG TACATATAAC TTTTAAAAAG TTGGACATAG rTATATTTTT CCATTTATTT 780 

ATTTACTTTA TATCCTGATC ACAGACCCCC CCCTCCTCTG GATTAACTCT CTCCACTGCT 840 

TCTTACCCCT CCCCATCTCT CCTTCACCTC TGAGAAGGGG GGATACCTCC TGTCTTATCT 900 

GGTTTCAGTG GGAGAAGGAT GTATCCTAAC ACATATAATT TTTAATATCC TGAGTmTC 960 

TTTCATACAC CTTACTTATT CTATTCATTT TTCAGGAAGG CATGTTTAAT GTTTTTTnT 1020 

TAATTTTATG TGTACGAGTG TTTTCCCTAC ACAGTCATAG TGCATCGCAT ACATTTTTGC 1080 

TGCCCGTAGA GATCAGAAGG GAGCATTGGG TTCCCTAGGA CTGGAGGCAT GAACCACCTT 1140 

GTGGGTGCAG AGAACTGAGC CTGGGTCATC TCAAAGCATC AGGTTCTTCT TGAGTCATCT 1200 

CACTTGCCAC TTCTCCCATT TACTGATTTT ATCTGTGTGC AGACATTCAT GGCCCAGTCC 1260 

ACAGGTGGAA GTCAGGGACA ACCTATAGGA GTCAGTCCTC TCCTTCTACC GTGTGAGTCC 1320 

CTGGCCTCAA ACTCAGGTTG TCGGGCTTCA TAGCAAGAGC TTCTATTTGT TGAGCCATCT 1380 

TGCTAGCCCC ACCCCATACT ATCTTTATAA TATCTGTTTA ATTAAGACAT TCATAATGAA 1440 

TITTATTAAC ATTCATCGTT ATCCCCTTTA CCAATrrCAC TATGTATTAA TTGCCACCCC 1500 

TrrAAATTTA ATTACTTCCT TGGCTGGGTT TTACAGGA6A GTTCCAGGAA GCTAGATGGA 1560 

GAGATGGCTC AACAGTTTAG AGCAACGGCT GTTCTTCCAG AGGACCTAGG TTCAAGTCCT 1620 

GGCACTCAGA GGTGGCTCAC AATCATCTGT GACTrCAGTT CCAGGGGATC TGAAGAATTC 1680 

rrCTGGGCTC CATGGGCATC AACTACACAC TTGGTTCATA GACATACATG CCAGCAAATG 1740 

ATTGATCCAT ACATATGAAA TAAACCATAA ACAGAAAAAA AAAAGGAAGG TGAGGGAAGG 1800 

AAAAAAAGTT TAAAAAAAGG AAAGGAAGGA AGGAAGGGAN NNNNHNNNNN NNNNNNNNNN i860 

NNNWNNNNNN KNKNNNNNNN NNNKTCTCTC CATACTGAAA GATGTCCACA ATGACTAAGG 1920 

GAATTnTTT TAAAAGACAA GCACAACGTT TTCTAGGGAT CAAACTCTAT TTGTGAGGAA 1980 

GACTGGTGGT TTGAAGATTA CATAGCAGAG TTACATCTAA CATGAGCGTG TTTCCCCTGG 2040 
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ATGGAAGGAG TCTGATAACT TGTCTTTCTT TCTTAGTTAG CATCTCAGAG TCCCCCGCCT 
CCCTTAACAT CCTTTTTGCA CACCATCTTT TTAGGRAAAT GGATCATTTA TGGGGATGTA 
GTGATrrGTA CAAGAATGTC CCCTGTGGGC TCAGATATTT GAATACTTAG TTCCCAGTTG 
GGGGAGCTTT TGTAGGGAGG TTCGGAGGO^'^CAGCCTGGCA GGAGGAAGCA TGCTAGCAGC 
ITTGAGACTA TAAACCCTCA TCTACTACCT TGITCTCTTT CTGCATTGTG CTGTGTCTGA 
CACTGTGAGA TTCCTGCTCC CGATGCCATG CCTGCCCGCC ATGATAGACT CCTAGCCCTC 
TGGAAAGGTA ACCTCAGTGA ACTCTCTTCT ATAAGTTrCT TTGCTCCTGC TGTTTTATCA 
CTGAAACGGA AAAGCTTGCA GGGAGGTAGG AGGCAGCCTG TGGCGTTGAT TCAATGCACC 
TGGCCTTATC CTCGGATGAG ATCGGTCACC AGTCAAAAAC TGTGAGCTTG AAGGTCTTGG 
GTGCTTAACA TCTATTTTTA CAAATCTTAT TTAGCAACTT AGAACTGTGA AATATTGGAA 
AGCTACTTAA ACCTTCTAAA CTCCCTCCTC CACACTATGA GAATGTTACA TTTXX:TATTC 

AGTTATrrrr gagcagtaaa cagatgaatc aaggaatatg CCCATCACAT caagagtgct 

CCTAAATGGA CTTGCTTGTT ATTCATTTAC AGTGTGGCCC CTTGACTTTC ATCGGCACTC 
CTAGCAGAAA ACAAAATCCG CCAGATGGAG CTGGAGAGAT GGCTCAGCTG TTAAGAATAC 
TTATCCCTAC ACAGGCCCTG GAGCCAGTTC CCAGCACCCA CACGGTGGCT CACAACCATC 
TGTAACTCCA GTTCTAGGAG ACCCGACTCC CTCTTCTGTC TGAAAACACC AGGCACGCGT 
GCGGTCTACA TACAAACATG AAAGCAAAAT ACACACATTA CATAAATAAA TCTTAAAAAA 
TGATTCGGGG TGGGGGAAGG AAAAAAAAGG ATGTTAGAAA ATCGATGTAA CTOTTTTTTC 
CrrTTGCACA GATCTAAGTT AGGGARGGAG AACATTCTCT TACCATCGAA AATAATTGTT 
TTCATTGCCC CCAAGTCTGC TAATAGACCT TGCTACCTTC ATGGCTGTCG TAAGGATGAG 
GCAAAGATGG ACTTCAGCTT TCAGACTGTG TCTGCTCAAA TGTTGGCTAC TCCTGTTTTC 
TGACCCCCTT CTCTGGTGCA ATCTGGACTT TCAATTAATT TCCCTGCATC TTTTACATAT 
TTGATTTAAA AAATATTTTA TTTTATGTAA TTGTATGTAT ATGCATGTCA ATAAGCATAT 
GTGTGTGTGT TTCCATGGAA ACCAAGGCAA CAGATTITCC AGAGCTGTAG AAATGGGCTG 
TGAGACGCCC ACTGTX3GGTG TTCGGAACCA AACTCGGGTC CTGTGGAAAG ACAGCGAGCA 
CCCATAATGC AGAGGTATCT CTCAGATTTT ACTTTAAAAT TTCAATTTTC TTTTTXTTTT 
T-TAAAGTTCC AAGTAACTAT AGGAAAGTAC ATGGGTATAT AOATCCCCAG TACCAAGATT 
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CTTCCTTTGC 
CACTCATCAC 
TCCAGGTAAC 
ACTTGGGCAT 
TGTGCTCGGT 
CAGAAACTTA 
ACTGACTGGT 
ACACTGTAGT 
ATGTCTTCTT 
GATCTTTCTG 
AGAATGCTCT 
TGCTCTGTTG 
AAGAGCTTCA 
TCCGTTTGGG 
ACZW^GGTTGG 
TCATGCTTGC 
CCTTTTCTTC 
AAGTGTAAAG 
AGATCCTGTT 
CTAAATGGGG 
TTCTTAAGTA 
CCTTGTCTGG 
TCCCTCTACC 
ITTTACTCTC 
CTCTTAGCTA 
TTCAGAACGC 
AGCTGAATAT 
TiGCCCACACC 



AGGTAGCACA 
ACTGTAAAGT 
TGAGCCTTCT 
GGTGAGAAGG 
TTATAAACGG 
AGAAAACACT 
GGCAATGACC 
TGTGAAAATG 
TCCCACCTAC 
GGGACTTCTC 
TGAACAAGTG 
AGGTCCAGTT 
GCTGGGAGAC 
CTGGCGCX3CC 
CACCACATGA 
ACTGAAACTT 
TTCCTTTTTA 
TATTTCCCTA 
AAATTTGGGT 
CGATTACCAC 
AAACATAGAA 
CCACCGGGAT 
CAGATCTGAC 
TGCAAGAGAA 
AGAGGCCACA 
TGTCGATAGG 
ACAACAAAGC 
AAACATGCCC 



ACTTGGTTTG 
AGAATTGAAC 
TTTCCTCCTA 
AAACAACGCA 
AGCCCACCTT 
TACACGAAGC 
GAAACCACAG 
AACTTATTCA 
CTTTGCGGCA 
ATAACATTTC 
TGTGTGTGTG 
TTGATGGTCC 
ACTGCCrCTT 
TTGGGGATCC 
GCCTTTTAAA 
CACAAGCTGA 
AAACACATCT 
TGTGTGTGGA 
GCCATAGCCA 
GGGCCTGCAG 
GGGGAAGTGT 
GGAGAAGACC 
AGCCCTCCTT 
GTTTCCTTAA 
CACCCAGGGG 
TACACGAAGC 
TAAATGTGAG 
AAGATCTTCA 



TTTCACATAA AGAATGGAAA GTCATTAAAA 3720 

TCTGACAGAA CAAGCGAAGT GAGTCTGACT 378 0 

AAGACACAAG CCATACACAG AGTAAAATAA 3840 

GGAGGGCTAG CCAAGTCTGA GAGTCGTGAG 3 90 0 

GCCAGCGAGG TAGTCACATG CTCTGCTAAA 3S60 

AAACATGGGG AAGTGCCATG CAAGCATGTG 4020 

CAGCCACTAG AAAAGGAAGG GTAGTGCGCC 4080 

TTTATTTTCA AAAACGTGTA AGAAGCAAAG 4140 

GGCGAGCACT TCCTGGAATT TATAAAGTGC 4200 

CTACTGCTCA TCTATGTCTG TGTCAAATAG 4260 

TGTCTCTGCG CGCGCACGCG CACTCACTCC 4320 

CGCCAGAGGT ATATTTGAGT ATCATTTCTC 4380 

ACTGGCCTGA AGGTCACTAG CTGATTCATC 4440 

TCCTATCTCT CCTTCCCCAG TGCTGGGATA 4500 

ATGTGAGTTT GGAAGCTCAA ACGCAGGTTT 4560 

ACCGTCTCCC TCTCCTTCCC TCTCTTTTTT 4620 

TGTCTTTAAA AAAAAAAAAA GGCCCAAAAC 4680 

GGGAGGGAGT ATAGGAGGCT GATTTCACTG 4740 

ATCAAAGACG CATCGTTTCC TCTAAGAATT 4800 

GTTCTGGITT GTATTAGAGG AGACACTGTC 4860 

CCAGAATTCT AAATAAGGCT TCGAGAGAAG 4920 

TACCrrCGCC TATCCAGGAT CCATCGTCCC 4380 

GGCTCTTTTG CTGAGGTTTG TrTGAGTTTG 5040 

ACATTCTACC CTGTTCACAA GTAAATACAC 5100 

GAACACCGAT AAAAAGAACA AGCCAGAACC 5160 

AGCCTTCATA CGGAGTTTTC ATTCGTGAGG 5220 

CAGACCAGGC ATGCCTCTGC TAAATGAGGA 5280 

AGTATAATTT TATTATATAG ATTCGCTATG 5340 
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TGTTGACATG TTTTTATAGT GAACCTGGAT TTTACAAACC CTCCTGGTTT GCCACCTGCT 5400 

TCTGGCACCA TACTTGAGGC TTAGGCACGT GATAAAGGAG CATGCCTGTT TCCCCCCTTA 5460 

rrrTTTTTAA AGAAAAGCAC CATGTTACAT CATTAATCAT GCATATCAGT GTAGTTTAGA 5520 

TCCGATGTAG AGACAATAAT CTTATCTCTt"tGTCTGGCTG AAAGACTGTC CTTTAAACTA 5580 

TCATTCTAAA TGCATTTGGT TTTTGCCAGG AGTAAAACAT GTCACAAGAT ATTTGTTGTC 564 0 

ATTTCCCAGG CGTGGAAGGA AAGGAATCGA AAGAAAACCA GGGGTGAAGG CTGCTGTTCC 57 00 

TCTCTAGTCG CTACTTGAAG TCTACATAGC TGGGGGGGGG GGGGGGACTG rrCACATGGG 5760 

ACCGGTTTCC TCTTTGTTCC TACACTGGCG CCTCTGGCAA AAAACTCTCC CTTCTCTTCC 5820 

CCCCAAGCAT ATCTTGGCTG AAAGGTCAGC TCTGAAAAGG GGCCTGGCCA AAGTTACTGT 5880 

AGGGGACCGT GGTCATGGAA CTGGGTAAAC AAAAGCACTC TAGCAGCCAC TGGAAAAGGA 5940 

CCGGGGGCTC TTCTCTGTCC ATTTGCCCTG GAACCCTCAC CACCGCCAGC TCCCTGCATC 6000 

TCCTTGCTAT GaGTTTTCTG GACCGACCCA GCCAGGAAGT TCACAACCGA AATGTCTTCT 6060 

AGQGCTAATC AGGTAACTTC GGACGATTTA AAGTTGCCAG ATGGACGAGA AAACAGTAGA 6120 

GGCQTTGGCA ACCTGGATAA GCGCCTATCT TCTAATTAAA ACATTCAGAC GGGGCGGGGG 6180 

ATGCGGTGGC CAAAGCACCA TAAAACAAAA CTTCCAAGTA CTGACCAACT CACTGCAAOT 6240 

TTGTGCCCCG AGTACATCTA GGTTCAGGGG rrCTTGTCTT CATGCTCCCA ACTGCGGGCG 6300 

GATTTTTGGT CCCTTGGGAC TTTCAGTGCA GCGGCGAAGA GAGITCTGCA CTTGCAGGCT 6360 

CCTAATOAGG GCGCAGTCGG CCTCGTGTTT CTGGTGATGC rrCCCAGGTT GCTGGGGGCA 6420 

GCAAGTCTCT CAGAGCCCAT TACTGGCTAC ATTTTACTTC CACCAGAAAC CGAGCTGCGT 6480 

CCAGATTTGC TCTCAGATGC GACTTGCCGC CCGGCACAGT TCCGGGGTAG TGGGGGAGTG 6540 

GGCGTGGGAA ACCGGGAAAC CCAAACCTGG TATCCAGTGG GGGGCGTGGC CGOACGCAGG 6600 

GAGTCCCCAC CCCTCCCGGT AATCACCCCG CCCCCATTCG CTAGTGTGTA GCCGGCGCTC 6660 

TCTTTCTCCC CTGAGTCCTC AGGACCCCAA GAGAGTAAQC TGTGTTTCCT TAGATCGCGC 6720 

GGACCGCTAC CCGGCAGGAC TGAAAGCCCA GACTGTGTCC CGCAGCCGGG ATAACCTGGC 6780 

TGACCCGATT CCGCGGACAC CGCTGCAGCC GCGGCTGGAG CCAGGGCGCC GGTGCCCCGC 6840 

GCTCTCCCCG GTCTTGCGCT GCGGOaGCGC ATTVCCGCCTC TGTGACTTCT TTGCGGGCCA 6900 

GGGACGGAGA AGGAGTCTCT GCCTGAGAAC TGGGCTCTGT GCCCAGCGCG AGGTGCAGGA 6960 
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TGGAGAGCAA GGCGCTGCTA GCTCTCGCTC TGTGGTTCTG CGTGGAGACC CGAGCCGCCT 
CTGTGGGTAA GAAGCCCACT CTTTAGTAGT AAGGCGGAGA AGTAGGGTGC GGGCGOAGAG 
TGGGAATAGA AGAGGACCTA ACTCGTAGAG CTCTAGAGAC CCTCCTCCCT TGGGTGTTCT 
TTCACTTACC AATGGGGAAA CTGAGGTTCA^AAGACTCTTC CGAAATGACT CAGCCAGGAT 
TCTACTCTCC CCCGGGCATC GGTTGGAGCG TGTCCTGCGG AGCCGTCACA GCCCCTGGCG 
CTAGGTAGGC AGGAGTGGAA AGGCGGCCTG AGCCGGGGCA GGAGATGCTC CCACTGGCAG 
GAACAGGCGG TCAAACGCTG GGAAGCCAGC TCAAGCCAAG CGGCCCGGCT GGCATCAATC 
ACTCGGTGCT GTTGCCCACC GCCCTAGTGG GGGGCAGGGA ATCCGCCTCT GGCTCCGCTC 
CCCTTTAGCT CCAGCGTGTA AGCGCACGGA CTATGTGAGG GTAGGTCTCT TCATAGAGCA 
ACACTTTCCT CCCTCAACTT TCTTTGATGC AGAATGCTAT TTTTGCTGGT AGGAGGAAGA 
CGCGGCTTTC TCTTCTGTX3A CAGCTTCTCC AGGTGTATTA AACTAAATAA CTCTCCACTT 
ACCGACTCCA AAGCGCTGGT CCTGGGGTAA ACTCTGAAAG TCTCAGAAAC TCTTGAGCTT 
GGCACCTAGT TATAGGTCAC TTTTCTTGTT TTAAAATGCC CTCTGCTTCA AGGTTAGGCC 
CACACTCGCT CTTGGGCTTT TGTGCAATAA TTTCCCTrCC CTTCCCTTCC CTTCCCTTCC 
CTTCCCTTCC CTTCCCTTCC CTTCCCTTCC CTTCCCTTTC CCTCTTCCTT TTCCTCCTCC 
TCTTCCTCCT CTATTTCTCT GTCATTTCCT TTTTGAAGCC ACAGTTTGCA GATTTCCZJVT 
CTCCACCCAT TG6AGAATGG AGAATCAGGA AAAAAGAAGT CAATTCTGCA GAAACATTCC 
TTGCGCCCTA AGAGAATCGC ATGGCTTAAA AGCATTGGCA CTGACATACG GCGCCAAGAT 
CGCCTGTCTA GAGCTATTGA GrTTTCCTCA TAATOACTTG GTTCATCAGG CTAGCTCCAC 
CACGAGTGCC CTCTTGTTCC TGAGAAGGCC GCACTCTCCC CCTTTCTGGG AAGAGAAAGA 
CAGCCTGGAA CATGT6CTTG CCCTGGGTTC CATAGAGAAG CAAGTTQCTT TAAAGCCCAG 
AGAATTCCTA GTGTAGCAGC TTAACAGCGT CCCGTTCTCT GAATAAGATG GAGGTTGCCC 
mTGGAGTG TGTGACTTGC TTAATTGGAT TGGGCTATAA TTGGTGCCAT CCAAGTCTCG 
AGACAGAGCC GCTGTTGTTT TTCCTTCTGG TCTTTGAGCG GGAAGGATAA CAGTGCACAA 
ATTAATTAAT GTTGGTTATC GGATTTGAAC ATAAAAGGGC TTTTATTGTA TAGTAGCATA 
TGTACCTCTT GCAGTCAGAA TGAGCTGTCT AAAGAACAGA ACCCAAACTT GCCGATGAAA 
ATGAATGAGG TTTAATAAAG GCGATGGATG AGCATTAGTC ACTGAT6TAA ATCTCCAGTT 
ATTGATAACC TCATTGACTG GATTTGATTG CAGACATGTA TTGGTATGGG GCATCCTTTA 
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AAGATGAGCA 
GAGACAGTTG 
TCTGCGCTCG 
GCAGTTTGTG 
TCAGATGAGC 
TACTTATCAA 
TTATTGAAAG 
ATTTCATATT 
TCTCAGGGCA 
AATTTGTTGG 
CTTTAACAAT 
AAAATATGTC 
AGTTGGGTAT 
GTGGAGAACT 
CCTTGGCTCr 
TGCCCAACAA 
ATTAGAAAGC 
TCTCACTGTG 
TGCTGGGCTT 
AAGATCCTGA 
TATCTCGCTG 
AATCAGTTAG 
TGATTAATGA 
GCTCTGAGTT 
TCCTTTTCCT 
AGATACCAAA 
GCTCCCGTCC 



TAGCCAACGT GCCTGCACTC TAAOAGAATC TATGGCTQTA TGTTATTACA 
AGAAGCTCTT AGTGGCTCTG GCGTGTAGAT CAGCGGTAGA GCGCTGAGGC 
CTTCCTGGCA CTGAAGAATA AAGGCCAITT ACTGTGGTGG TGCAGTGGGC 
ACGAGTTACT ACTACATTTT'^CTCACACAT CTGCCTGACT AATGAGTTCA 
GTATCCAGTG ATTXSTITGCA GGTTAATGGT TCTCAGTCAT GTTTAGAATC 
ACAAATTGTT TTCTCATTTC CTGCTTCTTC TCAAACAAAG TAAGATTCCA 
GCTTGTTTAA GAGCATTTTA ACTGCTTGCC TATGTTAGGG ACAGTGACTT 
GACAAATATT ATGCCGATTA ATTGAATATG ACTACCCAGT TCTATAGCTG 
GACCAAGAGC ATCTGTGATC CAGTCACTTT AAATGCCATT TAAAATGCAT 
TCTAGGAATA AACACACTGT AAAGTTTAGA ATCACGGCCC AAACACAAGT 
GCCAACTAGC TTCTGAGATT CATTAATGTC ATTTAATTAC CAATGTTTTA 
ATTAATTACT AAATCTATAG TTGTAACAGC AACACATX5TA CATCTTATTA 
ATTCAGGGTC GCATAGCTGT AGACTATTGC ACATCTGTGT TGGTGAGCCA 
GCCTCCTGGC TCTTCTCAGA AGGCCACAGT GTCACGGCAT TGGCTATTTG 
TTGCTAATAC TTTATTGACA TGGCCTCATC TTCGTTCACG TTCACTTAIT 
CGTCAATGCC AGCTCAGGCC TTAGGAGTCA TCTGTTCTTA GTCAGTGCGA 
CTGGATGCCT GCCTGCTATT AATTAGrEAT TCTTCTCTTC TGAGACAGAG 
TGGCCCAGGC TAGTCTCAAA CTTGCGGTCC ATTTGTCTCA CTCATCAGAA 
CCAGGTCTGT GCACCACACT AGGTAGCTCG CGTTTTAAGC TAAGAGCTGG 
TGTCCTTTAC CATGGTGGGC ATGTTACAGG TTAGTTGACT GAAAACTAGT 
TCTAATGACC TCCAGTGGTA TGTATCTCTC AAGATGCTTT TTTGCATTTC 
GTAACAAGTT CTTAAGTCTC CAGCTTGGTA TTGGCATGAG CTCAGAGCTT 
GTTGGGACCC CCTAGCTATT GCTCATTAGA CTTACACTAT TTTTAGTTTT 
TATCAATATG CATGTATGCA TGAACTTGGG AGATATTTTT CTTCCCCAAT 
CCATTTAAAT GTGCTGTCTr TAGAAGCCAC TGCCTCAGCT TCTGCAGCTC 
GGAAGTCTGG TACACAGCAT GATAAAAGAC AATGGGACGG GGTCACAGTG 
CTTTCAGGGG TATGGAGACG AGCTGTAGAG AGATGTCTCC AGGGAGTTTT 



8700 
8760 
8820 
.8860 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 



vO089855638Ihttj3^7/y/ww.^ ^5 of 1 



WO 98/55638 PCT/EP98A)3318 

63 



CATTAATCAG CAATTTAGTC AGATCTGTGC ATCCTATGCT TTACAAGAAA TGTCAGTQGG 10320 

CCTGAGATCA TCAGATGGAG GTTCATCGGG TTTCAATGTC CCGTATCCTT TTGTAAGACC 103 80 

TTGAAGTIGG CAACGCAGGA AAACAGGAAC TCCACCCTGG TGCCGTGAAT TGCAGAGCTG 10440 

TTGTGTTGGT TTGTGACCAT CTGCCCATTC JPTCCTGTTAT GACAGAGC1T GTGAACTTTA 10500 

ACTGGGACTG GGGCAAAGTC AATCCCACCT TTATACAATG AATTGCTGAA GAGGCCTTTT 10560 

AAAACTTGGA GTGTGCATTG TTTATGGAAG GGCTTTCCTA TTGGATCCAA CTCTTTTCTA 10620 

ATTTGTTTCT AGGTTTGCCT GGCGATTTTC TCCATCCCCC CAAGCTCAGC ACACAGAAAG 1068 0 

ACATACTCAC AATTTTGGCA AATACAACCC TTCAGATTAC TTGCAGGTAA GGATTCCTTT 10740 

TTGAGCCAGC TTTCCTATGT GAAAGGACTC ArTGTTTACT GAGGTCACAA CAATTTCCAC 10800 

TATTGCAGAA GTATAATAGT ATTCTTAC/A TTGTTTATAA ATCATGAGAC TTCTAAGAAC 10860 

CTATTTAATA ATGAAACAAT GGAAAAAGTC TTTTCAAACC TTTGTACTCT TTTGCTGAGC 10920 

CGTTTTCAAC ATGCACAAAC ATATTACACA AATATAACAT ACACAGGAAC ACACATtSAAT 10980 

GCATCGGATG ATGTGCCTAA AACTAGCATG TAATTGATAT TCACAATTAT TGATAAATTA 11040 

GTAAAGCAAA GGAATTCCTT ATGAATAGAG CTAAAATTCT ATCCATGrTC AAGTCACCCA 11100 

GAATGGCTTC TGGACATTTT TTTTTTrAGC TGTTTTCTAC AAGTGAAATT CTGCCTGTAT 11160 

TAGCAATTTA ATATCTAGCC AATAATATTC CTGACCATAT GTCX:TGrrCA GACCATGACC 11220 

rrCATAATCT GGCTTCATCT TCTGGGCTTC TTTCCCTCTT GCCAGCAAGA TGTCACGGTG 11280 

TTGATGCTGG ATAAACTGAG AAACAGAAGT TTTTCGCAAG AAGAGGACCT TGAATTTTGC 11340 

TTTTCCCCTG AGAGACAAGA AAGGAAACTT AGAGGAGGTG TAGCTGGGAG TGTGCTCATT 11400 

CATGAAAGAC CTGTTTGCAG GGCAGTGTGT TTTGCTGGGG ACAGTAATGA GCCTAGATCG 11460 

TAGTGCCATC CCAAGAGAGT GCTTX3GTGGC AAAAAGAGCC CTAGCAGCTT GTGGCAGTTG 11520 

CCTCATATTT GAAGAATACT AAGAGGTCCC CCGAATAACT CAGGGCTAGT GTTGATCATT 11S80 

GCATGTCGAG AGAATCCAAG CCTCCTATCT AGGGTCTT^CA AAAGTAACCA ATGCCCAGTC 11640 

TTTGGGGGAA AGCAAAACCA GAAAGCGATG ATAGCAGGAC CTGTrTATTT TCATTAAGTC lltOt 

ATGGCATTTC CAGAGACTTT GCTCCCCCTA TTCTCAGACA CAAAGCCCAC TTAAGATCTC 11760 

CCTCTCGAGA CTGCTGGGAA CATTTCTTAA GTTCTGAAAA AACCCTGGAG TGArTGGGCA 11820 

CAGACGATCC TGTCACTTCA TGTGAGTGCT AAGCTCnTG GGTGATGACT CAGTGGGTCA 11880 

CATTGTTTTA TTCATATTGA CTACCTTCCG TTTGCrTTGC GGAGAATGGA AGCTATAGAA 11940 
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GTCTGTTTGG 

TTACTCTCCC 

TCAGATGTGC 

TCCTCCCAAT 

TATTTTCCTT 

GTCTGTCTCC 

TGTACACTGG 

CTCTGTCCTT 

GGAATGAATG 

AGAGTGTGAG 

TTGAAGGTGA 

CTCATAGCCT 

AATACTTTTT 

CTGAGGAAAG 

TCACCATTCC 

TCGAC 



TGTGGCCCTC 
TTGCTTATAC 
CCTGGGTCCT 
GTGCATAGGG 
CACAGTATCA 
TTATATTAGT 
GCAACTACTC 
AATAATGCTT 
AATGAATGAG 
ATCTTTGGTG 
CTATTGAGTA 
CTCTTAATCA 
CAGGGGACAG 
GGTATTGGTG 
CAGGGTGGTT 



ACAAGGCACT 
CCACATAGGG 
GTCTGTCCTT 
CCAGCTCGCif 
AATCACCACA 
GATAGGTTCC 
TACCACTGAG 
AGCAATCATA 
CTAACAGAAA 
GCATGTCCTT 
GITTATATGC 
TAGCCCTGTG 
CGGGACCTGG 
ACTGAATGCG 
GGAAATGATA 



GTOAGCTTCT 
ACTCTGGCTT 
CACACTTACT 
"CCAAATCCTC 
GTTTATGCAA 
AGAAAGGCAC 
CGTGATATCC 
TTTGCTTAAA 
ACTCATGACC 
GTAGACTGTC 
ATGTGAAAAA 
GCATGGAGTG 
ACTGGCTTTG 
GCGGTGGTGA 
CTGGAGCCTA 



TCTCTCTGTG 
TGTTGCTGTT 
GATGCTGCCT 
TCTTTTCTTT 
CAAACTGAAA 
TGATTTTTTT 
TTGGTCCCTT 
ATATTTATTG 
ATGTGGGTGA 
TGCCACCAGT 
CCAAACCTTC 
TACCATTGAT 
GCCCAATGCT 
CAGTATCTTC 
CAAGTGCTCG 



TGCTAACTTC 

CTTCAATGCT 

GGAATGCTAT 

GCCTCTTTTA 

CTTTAAAATT 

TCTTCCCTGG 

AAAAGTTATC 

AATGACTGCA 

TTTCCGAAAC 

ATCTATCATC 

TATTCTCTTA 

ATCTTCCTGG 

CAGCGTGATT 

TGCAAAACAC 

TACCGGGACG 



(2) INFORMATION FOR SEQ ID NOj 2: 

(1) SEQXJKNCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ■ "oligonucleotide*' 

(iii) HYPOTHETICAL: YES 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 
GGGGTACCGA ATTCTAAATG GGGCGATTAC C 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = »• oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTGGTACCCA AACACTCAAC ACCACTG 27 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "oligonucleotide" 

(iii) HYPOTHETICAL; YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TCGGTACCGA CCCAGCCAGG AAGTTC 

(2) INFORMATION FOR SEQ ID NO: 5: 



26 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid ^ 

(C) STRANDEDNESS J single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc » "oligonucleotide" 



(iii) HYPOTHETICAL: YES 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TTGCTAAGCT TCCTGCACCT CGCGCTGGG 29 



(2) INFORMATION FOR SEQ ID NOi feT 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 6: 
AGGGATCCAC TCTTTAGTAG TAAGGCG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ACCTCGAGAC TTGGATGGCA C ^1 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGCTATAAT TGGTGCCATC C 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide* 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGATGGAGAA AATCGCCAGG C 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHCTTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc - "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTGTGCATTG TTTATGGAAG GG 
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(2) INFORMATION FOR SEQ ID NO: 11: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ^. 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CATAGACATA AACAGTGGAG GC 22 

(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

....(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATGGTACCCA GGTTGCTGGG GGCAG 25 



(2) INFORMATION FOR SEQ ID NO; 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEKGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

- . . . (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc •« "oligonucleotide" 



(iii) HYPOTHETICAL: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGGTGCCGGA AACCAGGCAA A 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

. . , . (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 14 1 

ATCCTCTGCA TGGTCAGGTC 



(2) INFORMATION FOR SEQ ID NO: 15: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

. . . , (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "oligonucleotide*' 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 15: 

CGTGGCCTGA TTCATTCC 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc « "oligonucleotide*" 

(iii) HYPOTHETICAL: YES 

(Xi) SEQOEHCE DESCRIPTION: SEg ID NO: 16: 
GGGAATTCAC CATGAGTTCT GAACGTCGAA AAG 3 3 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc ^ "oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAGCGGCCGC TCATTTATCG TCATCGTCCT TGTAATCGTT AACTTGATCC AAAGCTCTG 59 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 18: 
GGGAATTCAC CACAATGACA (SCTGACAAGG AG 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

<iii) HYPOTHETICAL; YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AAGCGGCCGC TCATTTATCG TCATCGTCCT TGTAATCGTT GGTGGCCTGG TCCAGAGCTC 60 
TGAG 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide « 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CCGGTACCCA AACCCCGCCC AGCGTCTTG 29 



(2) INFORMATION FOR SEQ ID NO; 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc ^ "oligonucleotide** 
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(iii) HYPOTHETICAL J YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CCGACAAGCT TCaTCGCTCG GTGTTCGAGG 
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Claims 

1 . A recombinant DNA molecule comprising: 

(a) at least one first regulatory sequence of an intron of the Vascular 
Endothelial Growth factor (VEGF) receptor-2 (Flk-1) gene or of a gene 
homologous to the Flk-1 gene being capable of confening expression in 
endothelial cells in vivo; and 

(b) operatively linked thereto a heterologous DNA sequence, 

2. The recombinant DNA molecule of claim 1, wherein said first regulatory 
sequence comprises a GATA-binding site, an AP-1 binding site, an SP1 
binding site, an NFkB binding site, a STAT binding site, a Scl/Tal-1 binding site, 
an Ets-1 binding site, a PEAS consensus sequence or any combination(s) 
thereof. 



3. The recombinant DNA molecule of claim 1 or 2, wherein said first regulatory 
sequence is selected from the group consisting of 

(a) DNA sequences comprising a nucleotide sequence as given in SEQ ID 
NO:1; 

(b) DNA sequences comprising a nucleotide sequence of SEQ ID NO: 1 from 
nucleotkle 8260 to nucleotide 10560, from nucleotWe 8336 to nucleotide 
10608 and/or from nucleotide 10094 to nucleotide 10608; 

(c) DNA sequences comprising the nucleotkle sequence Of the human Flk-1 - 
intron; 

(d) DNA sequences comprising a nucleotide sequence which hybridizes with a 
nucleotide sequence of (a)» (b) or (c) under stringent conditions; 

(e) DNA sequences comprising a nucleotide sequence which is conserved in 
the nucleotide sequences of (a), (b) and (c); and 
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(f) DNA sequences comprising a fragment, analogue or derivative of a 
nucleotide sequence of any one of (a) to (e) capable of conferring 
expression in endothelial cells. 

4. The recombinant DNA molecule of any one of claims 1 to 3. wherein said 
heterologous DNA sequence is operath/ely linked to further regulatory 
sequences. 

5. The recombinant DNA molecule of claim 4, wherein said further regulatory 
sequence is a promoter. 

6. The recombinant DNA molecule of claim 4 or 5, wherein said further regulatory 
sequence is a 3 -untranslated region. 

7. The recombinant DNA molecule of claim 5 or 6, wherein said promoter is a 
promoter of hypoxia inducible genes, genes encoding growth factors or its 
receptors or glycolytic enzymes. 

8. The recombinant DNA molecule of claim 7, wherein said growth factor is 
VEGF, PDGF or Fibroblast growth factor, 

9. The recombinant DNA molecule of any one of claims 5 to 8, wherein said 
promoter comprises a DNA sequence selected from the group consisting of 

(a) DNA sequences comprising the nucleotide sequence as given In SEQ ID 
N0:1 from nucleotide 6036 to nucleotide 6959; 

(b) DNA sequences comprising the nucleotide sequence of the human Flk- 
1/KDR promoter, 

(c) DNA sequences comprising a nucleotkle sequence which hybridizes with a 
nucleotide sequence of (a) or (b) under stringent conditions; 

(d) DNA sequences comprising a nudeotWe sequ nee which Is cons rved in 
the nucleotide sequences of (a) and (b); and 
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(e) DNA sequences comprising a fragment, analogue or derivative of a 
nucleotide sequence of any one of (a) to (d). 

10. The recombinant DNA moiecule of any one of claims 1 to 9, wherein at least 
one of said DNA sequences is of human or murine origin. 

11. The recombinant DNA molecule of any one of claims 1 to 10. wherein said 
heterologous DNA sequence being operatively linked to said regulatory 
sequences is located 5' to said first regulatory sequence. 

12. The recombinant DNA molecule of any one of claims 1 to 11. wherein said 
heterologous DNA sequence encodes a peptide, protein, antisense RNA. 
sense RNA and/or ribo^me. 

13. The recombinant DNA molecule of claim 12, wherein said protein is selected 
from the group consisting of Vascular Endothelial Growth Factor (VEGF), 
Hypoxia Inducible Factors 7(HIF). HIF-Related Factor (HRF), tissue 
plasminogen activator. p21 cell cycle inhibitor, nitric oxide synthase, interferon- 
y, atrial natriuretic polypeptide and monocyte chemotactic proteins. 

14. The recombinant DNA molecule of claim 12. wherein said protein is a scorable 
mariner, preferably luciferase, green fluorescent protein or lacZ. 

15. The recombinant DNA molecule of daim 12, wherein said antisense RNA or 
said ribozyme are directed against a gene Involved In vasculogenesis and/or 
angiogenesis and/or tumors of endothelial origin. 

16. A nucleic acid molecule of at least 15 nucleotides In length hybridizing 
specifically with the first regulatory sequence of a recombinant DNA molecule 
of any one of claims 1 to 15. 
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17. A vector comprising a recombinant DNA molecule of any one of claims 1 to 1 5. 

18. The vector of claim 17, which is an expression vector and/or a targeting vector 

19. The vector of claim 17 or 18, further comprising a gene capable of expressing 
H[F-2a. 

20. A cell transfonmed with a DNA molecule of any one of claims 1 to 15 or the 
vector of any one of claims 17 to 19. 

21 . The cell of claim 20, which is a prokaryotic or eui<aryotic cell. 

22. The cell of claim 20 or 21 . which is an endothelial cell. 

23. The cell of any one of claims 20 to 22. further comprising a recombinant DNA 
molecule or vector containing a gene capable of expressing HIF-2a. 

24. A pharmaceutical compositton comprising a recombinant DtiA molecule of any 
one of claims 1 to 15, the vector of any one of claims 17 to 19 and/or the 
nucleic acid molecule of claim 16 and optionally a pharmaceutlcally acceptable 
earner. 

25. A diagnostic composition comprising a recombinant DNA molecule of any one 
of claims 1 to 1 5, the vector of any one of claims 1 7 to 1 9, the cell of any one of 
claims 20 to 23 and/or the nucleic acid molecule of claim 16, and optionally 
suitable means for detection. 

26. A method for ttie production of a transgenic non-human animal, comprising 
introduction of a recombinant DNA molecule of any one of claims 1 to 15 or a 
vector of any one of claims 17 to 19 into a gerni cell, an embryonic cell or an 
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27. A transgenic non-human animal comprising stably integrated into its genome a 
recombinant DNA molecule of any one of claims 1 to 15 and/or the vector of 
any one of claims 17 to 19 or obtained according to the method of claim 26. 

28. The method of claim 26 or the transgenic non-human animal of claim 27. 
wherein said animal is a nrause. 

29. A method for the identification of a chemical and/or biological substance 
capable of suppressing the transcription of a gene in endothelial cells 
comprising: 

(a) contacting a cell of any one of claims 20 to 23 or the transgenic non-human 
animal of claim 27 or 28 either of which is capable of expressing the 
heterologous DNA sequence with a plurality of compounds; and 

(b) determining those compounds which suppress the expression of said 
heterologous DNA sequence. 

30. A method for the identification of a chemical and/or biological substance 
capable of activating and/or enhancing the transcription of a gene in 
endothelial cells comprising; 

(a) contacting a cell of any one of claims 20 to 23 or the transgenic non-human 
animal of claim 27 or 28 either of which is capable of expressing the 
heterologous DNA sequence with a plurality of compounds; and 

(b) detemiining those compounds which are capable of activating and/or 
enhancing the expression of said heterologous DNA sequence. 

31 . Use of a recombinant DNA molecule of any one of claims 1 to 1 5, the vector of 
any one of claims 17 to 19, the cell of any one of claims 20 to 23, the 
pharmaceutical composition of claim 24, the diagnostic composition of claim 25 
and/or the transgenic non-human animal of claim 27 or 28 for the Identification 
of a chemical and/or btological substance capable of suppressing or activating 
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and/or enhancing the transcription, expression and/or activity of genes and/or 
its expression products in endothelial celis. 

32. The method of claim 29 or 30.or the use of claim 31, wherein Oie chemical 
and/or biological substance is selected from the group consisting of peptides, 
proteins, nucleic acids, antibodies, small organic compounds, hormones, 
neurotransmitters, peptldomimics and PNAs. 

33. A method for the production of a phamaceutical composition comprising ttie 
steps of the method of claim 29 or 30 and (c) fomiulating the compound 
identified In step (b) in a pharmaceutically acceptable fonn. 

34. A method of inhibiting a vascular disease in a subject, comprising contacting 
an artery of said mammal with the vector of any one of claims 1 7 to 1 9. wherein 
said heterologous DNA sequence encodes a protein that reduces or prevents 
the development of the vascular disease. 

35. The method of claim 34. wherein said protein reduces proliferation of smooth 
muscle cells. 

36. Use of a recombinant DNA molecule of any one of claims 1 to 1 5. the vector of 
any one of claims 17 to 19, the nucleic acid molecule of claim 16 and/or a 
substance Identified by the method of claims 29. 30 or 32 for the preparation of 
a composition for directing or preventing expression of genes specifically In 
endothelial cells. 

37. Use of a recombinant DNA molecule of any one of claims 1 to 1 5. the vector of 
any one of claims 17 to 19, the nucleic add molecule of claim 16 and/or a 
substance identified by the metiiod of claims 29. 30 or 32 for the preparation of 
a pharmaceutical composition for treating, preventing and/or delaying a 
vascular disease and/or a tumorous disease in a subject 
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38. Use of a recombinant DNA molecule of any one of claims 1 to 15, tiie vector of 
any one of claims 17 to 19 and/or tiie nucleic acid molecule of claim 16 for the 
preparation of a phannaceutical composition for Inducing a vascular disease in 
a non-human animal or in the transgenic non-human animal of claim 27 or 28. 

39. The method of claim 34 or 35 or the use of any one of claims 36 to 38, wherein 
the vascular disease is atherosclerosis and/or a neuronal disorder. 



40. 



Use of a regulatory sequence as defined in any one of claims 1 to 3 for 
enhancing and/or directing gene expression in endothelial cells. 
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-5660 TCTAGAAT^iT .-.r-.ViGATA.\G T7TGCGTACA ATTCAGTCC7 TTGAAGACCT 
GATAAGCTTT A.-.GAAGGAAG ATGGGTTACA CATTGGGAAA TGGTTGCAAT 
CTGCACATGG CAGAGGCAAG AGATGCAAAT CACATTTCTT ACATACTCCA 
-6510 TACAAATCTT ACAAGACTGT TTTTGTTTCT CATTTAAAAT AAGAAGACCT 
GCCAGTCTTC CCCTTATTAC TAATTACAGT CACTCTGTAT CTTTGTTGAC 
ATTGGATAGT TTTACATACT TCAACAGGCT GGTGTCATTA AAGTTGTGGT 
GGGTGGGCAC CAGAGACACG TGATTCAGAG TGGGAGGAGA TGCAGGAGAA 
ACGAGGCACA GCAGAAGCAG AAGCGAGGAA AAACACTCTC AACGTTACTA 
ACACATCGAG AGGTTCCGCA CACTAGCAAT ACGGGCTGAA TCTGACCTAA 
TCTCTGCTGT TGAAAATTTT GCCTAGCCGC ACACTAGCAA TACGGGCTGA 
ATCTGACCTA ATCTCTGCTG TTGAAAATTT TGCCTAGCCT GTCACACAAG 
TGCTGAGCAT ACAGAAAAAG GAGAGTAATT CTCTGGTTCT TTGACTAACC 
AAATAGTCTA TATCAAATTG CCTAAGATAA TGTATACATT TAGTACATGA 
-6010 CTGGTTATAC CTATTCTATA TGACTATTAT TTAAATGTGA ATTTACAAGT 
6AGCATATGA AGTCCATTTT ACATGGCTAG TACATATAAC TTTTAAAAAG 
TTGGACATAG TTATATTTTT CCATTTATTT ATTTACTTTA TATCCTGATC 
ACAGACCCCC CCCTCCTCTG GATTAACTCT CTCCACTGCT TCTTACCCCT 
CCCCATCTCr CCTTCACCTC TGAGAAGGGG GGATACCTCC TGTCTTATCT 
GGTTTCAGTG GGAGAAGGAT GTATCCTAAC ACATATAATT TTTAATATCC 
TGAGTTTTTC TTTCATACAC CTTACTTATT CTATTCATTT TTCAGGAAGG 
CATGTTTAAT GT7TTTTTTT TAATTTTATG TGTACGAGTG TTTTGCCTAC 
ACAGTCATAG TGCATCGCA7 ACATTTTTGC TGCCCGTAGA GATCAGAAGG 
GAGCATTGGG TTCCCTAGGA CTGGAGGCAT GAACCACCTT GTGGGTGCAG 
AGAACTGAGC CTGGG7CATC TCAAAGCATC AGGTTCT7CT TGAGTCATCT 
CACTTGCCAC TTCtCCCATT TACTGATTTT ATCTGTGTGC AGACATTCAT 
GGCCCAGTCC ACAGGTGGAA GTCAGGGACA ACCTATAGGA GTCACTCCTC 
TCCTTCTACC GTGTGAGTCC CTGGCCTCAA ACTCAGGTTG TCGGGCTTCA 



Figure 1 
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TAGG^GAGC TTCTArTTGT TGAGCCATCT TGCTAGCCCC ACCCCATACT 
ATCTTTATAA TATCTG??TA AITAAGACAT TCATAATGAA TTTTATTAAC 
ATTCATCGTT ATCCCCT7TA CCAATTTTAC TATGTATTAA TTGCCACCCC 
TTTAAATTTA ATTACfTCCT TGGCTGGGTT TTACAGGAGA GTTCCAGGAA 
GCTAGATGGA GAGATGGCTC AACAGTTTAG AGCAACGGCT G TT C TTGCAG 
AGGACCTAGG TTCAAGTCCT GGCACTCAaA GGTGGCTCAC AATCATCTGT 
-5010 GACTTCAGTT CCAGGGGATC TGAAGAATTC TTCTGGGCTC CATGGGCATC 
AACTACACAC TTGGTTCATA GACATACATG CCAGCAAATG ATTGATCCAT 
ACATATGAAA TAAACCATAA ACAGAAAAAA AAAAGGAAGG TGAGGGAAGG 
AAAAAAAGTT TAAAAAAAGG AAAGGAAGGA AGGAAGGGA>} ^I^7^}^7HNKNNN 
NNNNtlNNNNN NNNNSNNNNN KNNNNNNNNN NNKNTCTCTC CATACTGAAA 
GATGTCCACA ATGACTAAGG GAATTTTTTT TAAAAGACAA GCACAACGTT 
TTCTAGGGAT CAAACTCTAT TTGTGAGGAA GACTGGTGGT TTGAAGATTA 
CATAGCAGAG TTACATCTAA CATGAGCGTG TTTCCCCTGG ATGGAAGGAG 
TCTGATAACT TGTCTTTCTT TCTTAGTTAG CATCTCAGAG TCCCCCGCCT 
CCCTTAACAT CCTTTTTGCA CACCATCTTT TTAGGAAAAT GGATCATTTA 
TGGGGA7GTA GTGATTTGTA CAAGAATGTC CCCTGTGGGC TCAGATATTT 
GAATACTTAG TTCCCAGTTG GGGGAGCTTT TGTAGGGAGG TTGGGAGGCA 
CAGCCTGGCA GGAGGAAGCA TGCTAGCAGC TTTGAGACTA TAAACCCTCA 
TCTACTACCT TGTTCTCTTT CTGCATTGTG CTGTGTCTGA CACTGTGAGA 
TTCCTGCTCC CGATGCCATG CCTGCCCGCC ATGATAGACT CCTAGCCCTC 
TGGAAAGGTA ACCTCAGTGA ACTCTCTTCT ATAAGTTTCT TTGCTCCTGG 

Hindirr (-4200) 
TGTTTTATCA CTGAAACGGA AAAGCTTGCA GGGAGGTAGG AGGCAGCCTG 



Figure 1 continued 
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astiii c-4100) 

TGGCG7TGA7 TCAATGCACC TGGCCTTATC CTCGGAr3?.G ATC GGTCACC 
AGTCAAAAAC T3TGAGCTTG AAGGTCTTGG GTGCTTAACA TCTATTTTTA 
CAAATCTTAT TTAGCAACTT AGAACTGTGA AATAT70GAA AGCTACTTAA 
-4010 ACCTTCTAAA CTCCCTCCtC CACACTATGA GAATGTTACA TTTTCTATTC - 
AGTTATTTTT GAGCAGTAAA CAGATGAATC AAGGAATATG CCCATCACAT 
CAAGAGTGCT CCTAAATGGA CTTGCTTGTT ATTCATTTAC AGTGTGGCCC 
CTTGACTTTC ATCGGCACTC CTAGCAGAAA ACAAAATCCG CCAGATGGAG 
CTGGAGAGAT GGCTCAGCTG TTAAGAATAC TTATCCCTAC ACAGGCCCTG 
GAGCCAGTTC CCAGCACCCA CACGGTGGCT CACAACCATC TGTAACTCCA 
GTTCTAGGAG ACCCGACTCC CTCTTCTGTC TGAAAACACC AGGCACGCG7 
GCGGTCTACA TACAAACATG AAAGCAAAAT ACACACATTA CATAAATAAA 
TCTTAAAAAA TGATTCGGGG TGGGGGAAGG AAAAAAAAGG ATGTTAGAAA 
ATCGATGTAA CTGTTTTTTC CTTTTGCACA GATCTAAGTT AGGGAAGGAG 
AACATTCTCT TACCATCGAA AATAATTGTT TTCATTGCCC CCAAGTCTGC 
TAATAGAGC7 TGCTACCTTC ATGGCTGTCG TAAGGATGAG GCAAAGATGG 
ACTTCAGC77 TCAGACTGTG TCTGCTCAAA TGTTGGCTAC TCCTGTTTTC 
TGACCCCC77 CTCTGGTGCA A7GTGGACTT TCAATTAATT TCCCTGCATC 
TTTTACATAT TTGATTTAAA AAATATTTTA TTTTATGTAA TTGTATGTAT 
ATGCATGTCA ATAAGCATAT GTGTGTGTGT TTCCATGGAA ACCAAGGCAA 
CAGATTTTCC AGAGCTGTAG AAATGGGCTG TGAGACGCCC ACTGTGGGTG 
TTCGGAACCA AACTCGGGTC CTGTGGAAAG ACAGCGAGCA CCCATAATGC 
AGAGGTATC7 CTCAGATTTT ACTTTAAAAT TTCAATTTTC TTTTTTTTTT 
TTAAAGTTCC AAGTAACTAT AGGAAAGTAC ATGGGTATAT AGATCCCCAG 
-3010 TACCAAGA77 C7TCCTTTGC AGGTAGCACA ACTTGGT77G TTTCACATAA 
AGAATGGAAA G7CATTAAAA CACTCATCAC ACTGTAAAG7 AGAATTGAAC 
TCTGACAGAA CAAGCGAAGT GAGTCTGACT TCCAGGTAAC TGAGCCTTC7 



Figure 1 c ntinu d 
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TTTCCTCCTA P.^^GACACAAG CCATACACAG A3T.\AAAT;iA ACTTGGGCAT 
GGTSAGAAGG .^Ai^.CAACGCA GGAGGGCTAG CCAAGTCTGA GAGTCGTGAG 
TGTGCTCGGT TTATAAACGG AGCCC\CCTT GCCAGCGAGG TAGTCACATG 
CTCTGCTAAA CAGAAACTTA AGAAAACACT TACACaiiAGC AAACATGGGG 
AAGTGCCATG CAAGCATGTG ACTGACTGGT GGCAATGACC GAAACCACAG 
CAGCCACTAG AAAAGGAAGG GTAGTGCGCC ACACTGTAGT TGTGAAAATG 
AACTTATTCA TTTATTTTGA AAAACGTGTA AGAAGCAAAG ATGTCTTCTT 
TCCCACCTAC CTTTGCGGCA GGCGAGCACT TCCTGGAATT TATAAAGTGC 
GATCTTTCTG GGGACTTCTC ATAACATTTC CTACTGCTCA TCTATGTCTG 
TGTCAAATAG AGAATGCTCT TGAACAAGTG TGTGTGTGTG TGTGTGTGCG 
CGCGCACGCG CACTCACTCC TGCTCTGTTG AGGTCCAGTT TTGATGGTCC 
CGCCAGAGGT ATATTTGAGT ATCATTTCTC AAGAGCTTCA GCTGGGAGAC 
ACTGCCTCTT ACTGGCCTGA AGGTCACTAG CTGATTCATC TCCGTTTGGG 
CTGGCGCGCC TTGGGGATCC TCCTA.TCTCT CCTTCCCCAG TGCTGGGATA 
ACAAGGTTGG CACCACATGA GCCTTTTAAA ATGTGAGTTT GGAAGCTCAA 
ACGCAGGTTT TCATGCTTGC ACTGAAACTT CACAAGCTGA ACCGTCTCCC 
TCTCCTTCCC TCTCTTTTTT CCTTTTCTTC TTCCTTTTTA AAACAOVTCT 
-2010 TGTCTTTAAA AJ^AAAAAAAA GGCCCAAAAC AJVGTGTAAAG TATTTCCCTA 
TGTGTGTGGA GGGAGGGAGT ATAGGAGGCT GATTTCACTG AGATCCTGTT 
AAATTTGGGT GCCATAGCCA ATCAAAGACG CATCGTTTCC TCTAAGAATT 
CTAAATGGGG CGATTACCAC GGGCCTGCAG GTTCTGGTTT GTATTAGAGG 
AGACACTGTC TTCTTAAGTA AAACATAGAA GGGGAAGTGT • CCAGAATTGT 
AAATAAGGCT TCGAGAGAAG CCTTGTCTGG CCACCGGGAT GGAGAAGACC 
TACCTTCGCC TATCCAGGAT CCATCGTCCC TCCCTCTACC CAGATCTGAC 
AGCCCTCCTT GGCrCTTTTG CTGAGGTTTG TTTGAGTTTG TTTTACTCTC 
TGCAAGAGAA GTTTCCTTAA ACATTCTACC CTGTTCACAA GTAAATACAC 
CTCTTAGCTA AGA3GCCACA CACCCAGGGG GAACACCGAT AAAAAGAACA 

Figure 1 c nttnued 
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AGCCAGAACC TTCAGAACGC 7GTCGATAGG TACACCi^AGC AGCCTTCATA 
CGGAGTTTTC ArTCGTGAGG AGCTGAATAT ACAACAAAGC TAAATGTGAG 
CAGACCAGGC ATGCCTCTGC TAAATGAGGA TGCCCACACC AAACATGCCC 
AAGATCTTCA AGTATAATTT TATTftTATAG ATTCGCTATG TGTTGACATG 
TTTTTATAGT GAACCTGGAT TTTACAAACC CTCCTGGTTT GCCACCTGCT 
TCTGGCACCA TACTTGAGGC TTAGGCACGT GATAAAGGAG CATGCCTGTT 
TCCCCCCTTA TTTTTTTTAA AGAAAAGCAC CATGTTACAT CATTAATCAT 
GCATATCAGT GTAGTTTAGA TCCGATGTAG AGACAATAAT CTTATCTCTT 
TGTCTGGCTG AAAGACTGTC CTTTAAACTA TCATTCTAAA TGCATTTGGT 
TTTTGCCAGG AGTAAAACAT GTCACAAGAT ATTTGTTGTC ATTTCCCAGG 
-1010 CGTGGAAGGA AAGGAATGGA AAGAAAACCA GGGGTGAAGG CTGCTGTTCC 
TCTCTAGTCG CTACTTGAAG TCTACATAGC TGGGGGGGGG GGGGGaACTG 
TTCACATGGG ACCGGTTTCC TCTTTGTTCC TACACTGGCG CCTCTGGCAA 
AAAACTCTCC CTTCTCTTCC CCCCAAGCAT ATCTTGGCTG AAAGGTCAGC 
TCTGAAAAGG GGCCTGGCCA AAGTTACTGT AGGGGACCGT GGTCATGGAA 
CTGGGTAAAC AAAAGCACTC TAGCAGCCAC TGGRAAAGGA CCGGGGGCTC 
TTCTCTGTGC ATTTGCCCTG GAACCCTGAC CACCGCCAGC TCCCTGCATC 
TCCTTGCTAT GGGTTTTCTG GACCGACCCA GCCAGGAAGT TCACAACCGA 
AATGTCTTCT AGGGCTAATC AGGTAACTTC GGACGATTTA AAGTTGCCAG 
ATGGACGAGA AAACAGTAGA GGCGTTGGCA ACCTGGATAA GCGCCTATCT 
-510 TCTAATTAAA ACATTCAGAC GGGGCG6GGG ATGCGGTGGC CAAAGCACCA 
TAAAACAAAA CTTCCAAGTA CTGACCAACT CACTGCAAGT TTGTGCCCCG 
AGTACATCTA GGTTCAGGGG TTCTTGTCTT CATGCTCCCA ACTGCGGGCG 
GATTTTTGGT CCCTTGGGAC TTTCAGTGCA GCGGCGAAGA GAGTTCTGCA 
CTTGCAGGCT CCTAATGAGG GCGCAGTGGG CCTCGTGTTT CTGGTGATGC 
TTCCCAGGTT GCTGGGGGCA GCAAGTGTCT CAGAGCCCAT TACTGGCTAC 
ATTTTACTTC CACC\GAAAC CGAGCTGCGT CCAGATTTGC TCTCAGATGC 

Figur 1 c ntinued 
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GACTTGCCGC CC3GC?.CAGT TCCGGGGTAG TGGGGGAGTG GGCGTGGGAA 
ACCGGGAAAC CCAAACCTGG TATCCAGTGG GGGGCGTGGC CGGACGCAGG 
GAGTCCCC^C CCCTCCCGGT AATGACCCCG CCCCCATTCG CTAGTGTGTA 
+1 (transcriatitih start} 
-10 GCCGGCGCTC TCTT7CTGCC CTGAGTCCTC AGGACCCCAA GAGAGTAAGC 
TGTGTTTCCT TAGATCGCGC GGACCGCTAC CCGGCAGGAC TGAAAGCCCA 
GACTGTGTCC CGCAGCCGGG ATAACCTGGC TGACCCGATT CCGCGGACAC 
CGCTGCAGCC GCGGCTGGAG CCAGGGCGCC GGTGCCCCGC GCTCTCCCCG 
GTCTTGCGCT GCGGGGGCGC ATACCGCCTC TGTGACTTCT TTGCGGGCCA 

VRE 

GGGACGGAGA AGGAGTCTGT GCCTGAGA AC TGGGCTCTGT GCC CAGCGCG 
AGGTGCAG GA TSG AOAGCAA GGCGCTGCTA GCTCXCGCTC TCTGGTTCTG 
CGTGGAGACC CGAGCCGCCT CICTGCGTAA GAAGCCCACT CTTTAGTAGT 
AAGGCGGAGA AGTAGGGTGC GGGCGGAGAG TGGGAATAGA AGAGGACCTA 
ACTCGTAGAG CTCTAGAGAC CCTCCTCCCT TGGGTGTTCT TTCACTTACC 
+490 AATGGGGAAA CTGAGGTTCA AAGACTCTTC CGAAATGACT CAGCCAGGAT 
TCTACTCTCC CCCGGGCATC GGTTGGAGCG TGTCCTGCGG AGCCGTCACA 
GCCCCTGGCG CTAGGTAGGC AGGAGTGGAA AGGCGGCCTG AGCCGGGGCA 
GGAGATGCTC CCACTGGCAG GAACAGGCGG TCAAACGCTG GGAAGCCAGC 
TCAAGCCAAG CGGCCCGGCT GGCATCAATC ACTCGGTGCT GTTGCCCACC 
GCCCTAGTGG GGGGCAGGGA ATCCGCCTCT GGCTCCGCTC CCCTTTAGCT 
CCAGCGTGTA AGCGCACGGA CTATGTGAGG GTAGGTCTCT TCATAGAGCA 
ACACTTTCCT CCCTCAACTT TCTTTGATGC AGAATGCTAT TTTTGCTGGT 
AGGAGGAAGA CGCGGCTTTC TCTTCTGTGA CAGCTTCTCC AGGTGTATTA 
AACTAAATAA CTC7CCACTT ACCGACTCCA AAGCGCTGGT CCTGGGGTAA 
+990 ACTCTGAAAG TCTCAGAAAC TCTTGAGCTT GGCACCTAGT TATAGGTCAC 
TTTTCTTGTT TTAAAATGCC CTCTGCTTCA AGGTTAGGCC CACACTCGCT 

Flgur 1 continued 
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CTTGGGCTTT TGT3C=:ATAA TTTCCCTTCC CTTCCCTTCC CTTCCCTTCC 
CTTCCCTTCC CTTCCCTTCC CTTCCCTTCC CTTCCCTTTC CCTCTTCCTT 
TTCCTCCTCC TCTTCCTCCT CTATTTCTCT GTCATTTCCT TTITGAAGCC 
ACAGTTTGCA GATTTCCAAT CTCCACCCAT TGGAGAATGG AGAATCAGGA 
AAAAAGAAGT C^ATTCTGCA GAAP.CATTCC TTGCGCCCTA AGAGAATCGC 
ATGGCTTAAA AGCATTGGCA CTGACATACG GCGCCAAGAT CGCCTGTCTA 
GAGCTATTGA GTTTTCCTCA TAATGACTTG GTTCATCAGG CTAGCTCCAC 
CACGAGTGCC CTCTTGTTCC TGAGAAGGCC GCACTCTCCC CCTTTCTGGG 
AAGAGAAAGA CAGCCTGGAA CATGTGCTTG CCCTGGGTTC CATAGAGAAG 
CAAGTTGCtT TAAAGCCCAG AGAATTCCTA GTGTAGCAGC TTAACAGCGT 
CCCGTTCTCT GAATAAGATG GAGGTTGCCC TT7TGGAGTG TGTGACTTGC 

Xhol (+1600) 

TTAATTGGAT TGGGCTATAA TTGGTGCCAT CCAAGT CTCG AGA CAGAGCC 
GCTGTTGTTT TTCCTTCTGG TCTTTGAGCG GGAAGGATAA CAGTGCACAA 
ATTAATTAAT GTTGGTTATC GGATTTGAAC ATAAAAGGGC TTTTATTGTA 
TAGTAGCATA TGTACCTCTT GCAGTCAjGAA TGAGCTGTCT AAAGAACAGA 
ACCCAAACTT GCCGATGAAA ATGAATGAGG TTTAATAAAG GCGATGGATG 
AGCATTAGTC ACTGATGTAA ATCTCCAGTT AT TGATAAC C TCATTGACTG 
GATTTGATTG CAGACATGTA TTGGTATGGG GCATCCTTTA AAGATGAGCA 
+ 1990 TAGCCAACGT GCCTGCACTC TAAGAGAATC TATGGCTGTA TGTTATTACA 
GAGACAGTTG AGAAGCTCTT AGTGGCTCTG GCGTGTAGAT CAGCGGTAGA 
GCGCTGAGGC TCTGCGCTCG CTTCCTGGCA CTGAAGAATA AAGGCCATTT 
ACTGTGGTGG TGCAGTGGGC GCAGTTTGTG ACGAGTTACT ACTACATTTT 
CCTCACACAT CTGCC TGACT AA TGAGTTCA TCAGATGAGC GTATCCAGTG 
ATTGTTTGCA GGTTAATGGT TCTCAGTCAT GTTTAGAATC TACTTATCAA 
ACAAATTGTT TTCTCATTTC CTGCTTCTTC TCAAACAAAG TAAGATTCCA 
TTATTGAAAG GCTTGTTTAA GAGCATTTTA ACTGCTTGCC TATGTTAGGG 

Figure 1 continu d 
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ACAGTGACTT ATrTCiTATT GACAPJ^TATT ATGCCGATTA ATTGPJ^TATG 
ACTACCCAGT TCTATAGCTG TCTCA.GGGCA GACCAAGAGC A7CTGTGATC 
CAGTCAC7TT AAATGCCATT TAAAATGCAT AATTTGT7GG TCTAGGAATA 
AACACACTGT AAAGTTTAGA ATCKcGGCCC AAACACAAGT CTTTAACAAT 
GCCAACTAGC TTCTGAGATT CATTAATGTC ATTTAATTAC CAATGTTTTA 
AAAATATGTC ATTAATTACT AAATCTATAG TTGTAACAGC AACACATGTA 
CATCTTATTA AGTTGGGTAT ATTCAGGGTG GCATAGCTGT AGACTATTGC 
ACATCTGTGT TGGTGAGCCA GTGGAGAACT GCCTCCTGGC TGTTCTCAGA 
AGGCCACAGT GTCACGGCAT TGGCTATTTG CCTTGGCTCT ttqctaatac 

tttattgaca tggcctcatc ttcgttcacg ttcacttatt tgcccaacaa 

cgtcaatgcc agctgaggcc ttaggagtca tctgttctta gtcagtgcga 

attagaaagc ct3gatgcct gcctgctatt aattagttat tcttctcttc 

+2990 tgagacagag tctcactgtg tggcccaggc tagtctcaaa cttgcggtcc 

atttgtctca ctcatcagaa tgctgggctt ccaggtgtgt gcaccacact 

aggtagctcg cgttttaagc taagagctgg aagatcctga tgtcctttac 

catggtgggc atgrtacagg ttagttgact gaaaactagt tatctcgctg 

tgtaatgacc tgcagtggta tgtatctctc aagatgcttt tttgcatttc 

aatcagttag gtaacaagtt cttaagtctc cagcttggta ttggcatgag 

ctcagagctt tgattaatga gttgggaccc cctagctatt gctcattaga 

cttacactat ttttagtttt gctctgagtt tatgaatatg catgtatgca 

tgaacttggg agatattttt cttccccaat tccttttcct ccatttaaat 

gtgctgtctt tagaagccac tgcctcagct tctgcagctc agataccaaa 

ggaagt ctgg tacacagc at gataaa agac aatgggacgg ggtcacagtg 

gctcccgtcc ctttcagggg tatggagacg agctgtagag agatgtctcc 

agggagtttt cattaatcag caatttagtc agatctgtgc atcctatgct 

ttacaagaaa tgtcagtggg cctgagatca tcagatggag gttcatcgqg 

tttcaatgtc ccgtatcctt ttgtaagacc ttgaagttgg caacgc agga 

Figure 1 c ntinu d 
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AAACAGGAAC TCCACCCTGG TGCCGTGAP.T TGCAGAGCTG TTGTGTTGGT 
TTGTGACCAT CT3CCCATTC TTCCTGTTA? GACAGAGCTT GTGAACTTTA 
ACTGGGACTG G33CAAAGTC AATCCCACCT TTATACAATG AATTGCTGAA 
GAGGCCTTTT AAAACTTGGA GTGtGCATTG TTTATGGAAG GGCTTTCCTA 
BamHI (+3900) 

TT GGATCCA A CTCTTTTCTA ATTTGTTTCT AGGTCTGCCT GGCGATTTTC 
+ 3990 TCCAXCCCCC CAAGCTCAGC ACACAGAAAG ACATACTGAC AATTTTGGCA 
AATACAACCC TlCAGATTAC TTGCAGGTAA GGATTCCTTT TTGAGCCAGC 
TTTCCTATGT GAAAGGACTC ATTGTTTACT GAGGTCACAA CAATTTCCAC 
TATTGCAGAA GTATAATAGT ATTGTTACAA TTGTTTATAA ATCATGAGAC 
TTCTAAGAAC CTATTTAATA ATGAAACAAT GGAAAAAGTC TTTTCAAACC 
TTTGTACTCT TTTGCTGAGC CGTTTTCAAC ATGCACAAAC ATATTACACA 
AATATAACAT ACACAGGAAC ACACATGAAT GCATGGGATG ATGTGCCTAA 
AACTAGCATG TAATTGATAT TCACAATTAT TGATAAATTA GTAAAGCAAA 
GGAATTCCTT ATGAATAGAG CTAAAATTCT ATCCATGTTC AAGTCACCCA 
GAATGGCTTC TGGACATTTT TTTTTTTAGC TGTTTTCTAC AAGTGAAATT 
CTGCCTGTAT TAGCAATTTA ATATCTAGCC AATAATATTC CTGACCATAT 
GTCCTGTTCA GACCATGACC TTCATAATCT GGCTTGATGT TCTGGGCTTC^ 
TTTCCCTCTT GCCAGCAAGA TGTCACGGTG TTGATGCTGG ATAAACTGAG 
AAACAGAAGT TTTTCGCAAG AAGAGGACCT TGAATTTTGC TTTTCCCCTG 

AGAGACAAGA AAGGAAACTT AGAGGAGGTG TAGCTGGGAG TGTGGTCATT 

CATGAAAGAC CTGTTTGCAG GGCAGTGTGT TTTGCTGGGG ACAGTAATGA 

GCCTAGATCG TAGTGCCATC CCAAGAGAGT GCTTGGTGGC AAAAAGAGCC 

CTAGCAGCTT GTGGCAGTTG CCTCATATTT GAAGAATACT AAGAGGTCCC 

CCGAATAACT CAGGGCTAGT GTTGATCATT GCATGTGGAG AGAATCCAAG 

CCTCCTATCT AGGGTCTACA AAAGTAACCA ATGCCCAGTC TTTGGGGGAA 

Figur 1 ntinued 
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